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REMARKS 

Reconsideration and withdrawal of the objection to and rejections of the claims, in view 
of the remarks herein, is respectfully requested. Claims 1-3 are amended. Claims 1-23 and 25- 
46 are now pending in this application. 

Claim 17 was objected to because the claim includes non-elected subject matter, i.e., 
SEQ ID Nos:47, 48, 49, 66-71, and 73-80. As Applicant timely traversed the Restriction 
Requirement, Applicant retains the right to request that the Commissioner review the propriety 
of the Restriction Requirement. Therefore, no amendment to claim 17 is required at this time. 

The 35 U.S.C. $112, Second Paragraph, Rejection 
Claims 1, 4-1 1, 17-19, 25, 30-31, 35-37, and 41-44 were rejected under 35 U.S.C. § 1 12, 
second paragraph, as being indefinite for failing to particularly point out and distinctly claim the 
subject matter which Applicant regards as the invention. The amendment to claim 1 obviates the 
§ 1 12(2) rejection. Accordingly, withdrawal of the § 1 12(2) rejection is respectfully requested. 

The 35 U.S.C. £ 112, First Paragraph, Rejection 
Claims 1-1 1, 15-16, 18-20, 25, 30-32, 34-37, and 41-44 were rejected under 35 U.S.C. § 
112, first paragraph, as failing to comply with the written description requirement. This 
rejection, as it may be maintained with respect to the pending claims, is respectfully traversed. 

The Office Action asserts that the recitation of "protein destabilization sequence", 
"mRNA destabilization sequence", "luciferase", and "reporter protein" fails to provide a 
sufficient description of the claimed genus of proteins, as it merely describes the functional 
features of the genus without providing any definition of the structural features of the species 
within the genus, citing for support Univ. Calif, v. Eli Lilly and Co. , 119 F.3d 1559, 43 
U.S.P.Q.2d 1398 (Fed. Cir. 1997). 

The Office Action continues, asserting that the specification only describes the 
polynucleotide having the nucleic acid sequence of SEQ ID NO:72, which encodes a fusion 
polypeptide comprising a specific luciferase isolated from firefly, a specific PEST sequence, and 
a specific CL1 , and that does not constitute a representative number of species to describe 
polynucleotides encoding a fusion polypeptide comprising a whole genus of variants, 



AMENDMENT AND RESPONSE UNDER 37 CF.R. § 1.11 1 
Serial Number: 10/664,341 
Filing Date: September 16, 2003 

Title: RAPIDLY DEGRADED REPORTER FUSION PROTEINS 



Page 10 

Dkt: 34 1. 021 US 1 



recombinant and mutants of any or all reporter proteins or luciferases, any or all protein and/or 
mRNA destabilization sequences or any or all PEST sequences. 

As amended, the claims recite particular reporter proteins (which have a particular 
function that is associated with a particular structure ) or a mRNA destabilization ( function) 
sequence structure . The claims also recite structure for the protein destabilization ( function) 
sequence , e.g., SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, or a 
PEST sequence, i.e., a sequence enriched in proline, glutamic acid, serine, and/or threonine 
residues. Thus, the properties (attributes) that identify the genus of nucleic acid molecules are 
that the sequences in the nucleic acid molecules are those for a particular reporter protein, and a 
particular protein destabilization sequence and/or RNA destabilization sequence, or 
combinations thereof. Underlying each of these properties is a structure . That is, not every 
nucleotide sequence encodes a reporter protein; not every nucleotide sequence present in a larger 
nucleotide sequence, once translated to a protein, confers a decreased half-life upon the 
translated protein; and not every nucleotide sequence present in a larger nucleotide sequence, 
once transcribed, confers a decreased stability to the resulting mRNA. 

Applicant need not describe what is known to the art. Hybritech, Inc. v. Monoclonal 
Antibodies, Inc. , 231 U.S.P.Q. 81, 94-95 (Fed. Cir. 1986). M.P.E.P. § 2163(II)(A)(3)(b) states 
that the description need only describe in detail that which is new or not conventional. 

The documents submitted with the Amendment filed on March 29, 2007, as well as the 
references cited against the claims under 35 U.S. C. § 103(a), clearly evidence that sequences for 
reporter proteins, protein destabilization sequences and mRNA destabilization sequences were 
known to the art (see Andreatta et al., Biotechniques , 30:656 (2001), Leclerc et al., 
Biotechniques , 29:590 (2001), Gilon et al., EMBO J. , 17:2759 (1998), King et al., Mot. Biol. 
Cell 7:1343 (1996), Rechsteiner et al., Sem. Cell. Biol. , 1:433 (1990), Fan et al., Genes and 
DeveL 11:2557 (1997), Balmer et al., Mol. Cell. Biol. , 21:2070 (2001), and Belanger et al, Soc. 
Neurosci. Abs. , 26:41 17 (2000); all of record). An explanation as to why these documents and 
references are not sufficient to show that sequences for reporter proteins, protein destabilization 
sequences and mRNA destabilization sequences were known to the art has not been provided. 
See M.P.E.P. § 2163.04(11). 
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Further evidence that sequences for the recited reporter proteins were known to the art 
prior to Applicant's filing is provided by, e.g., Bouthors et al., ( Protein Eng. , 12:313 (1999); 
Lorenz et al. ( Proc. Natl. Acad. Sci. USA , 88:4438 (1991); Matsumura et al. ( Nat. Biotech. , 
17:696 (1999); Murray et al. ( J. Mol. Biol , 254:993 (1995); Sirot et al. (Antimicr. Agents 
Chemo. , 41:1322 (1997); Voladri et al. ( J. BacterioL 178:7248 (1996)), Zhang et al. ( Proc. Natl. 
Acad. Sci. USA , 94:4504 (1997); and Wood ( Science , 244:700 (1989); a copy of each is 
enclosed herewith). 

Moreover, the specification exemplifies three different firefly luciferase sequences 
(structure) (SEQ ID NOs: 48, 49, and 66), two different click beetle luciferase sequences 
(structure) (SEQ ID NOs: 77 and 79), a Renilla luciferase sequence (structure) (SEQ ID NO:47), 
a green fluorescent protein sequence (structure) (SEQ ED NO:68), as well as other reporters (see 
page 4 of the specification), numerous protein destabilization sequences and sources of other 
protein destabilization sequences (see pages 5 and 23 in the specification, e.g., a PEST sequence, 
for example, a PEST sequence from cyclin, e.g., mitotic cyclins, uracil permease or ODC, a 
sequence from the C-terminal region of a short-lived protein such as ODC, early response 
proteins such as cytokines, lymphokines, protooncogenes, e.g., c-myc or c-fos, MyoD, HMG 
CoA reductase, S-adenosyl methionine decarboxylase, CL sequences, a cyclin destruction box, 
N-degron, or a protein or a fragment thereof which is ubiquitinated in vivo), and numerous RNA 
destabilization sequences and sources for other such sequences (see page 6 of the specification, 
e.g., a sequence which likely forms a stem-loop, one or more AUUUA or UUAUUUAUU 
sequences, and including the 3 1 UTR of the bradykinin Bl receptor gene). 

Therefore, more than one species of reporter protein, luciferase, protein destabilization 
sequence, and RNA destabilization sequence has been disclosed in the specification, contrary to 
the assertion in the Office Action that the specification only describes the polynucleotide having 
the nucleic acid sequence of SEQ ID NO:72, which encodes a fusion polypeptide comprising a 
specific luciferase isolated from firefly, a specific PEST sequence, and a specific CL1. 

Thus, one of skill in the art can quite clearly envision the structure of the encoded fusion 
proteins in view of the species disclosed in Applicant's specification. 

With regard to Univ. Calif, v. Eli Lilly and Co. . 119 F.3d 1559, 43 U.S.P.Q.2d 1398 (Fed. 
Cir. 1997), the specification at issue did not disclose a nucleotide sequence for the claimed 
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human cDNA and disclosed only one rat cDNA sequence for the claimed vertebrate or 
mammalian cDNAs. * 

In contrast to the claims at issue in Eli Lilly , Applicant discloses numerous nucleotide 
sequences falling within the scope of the claimed genus, as discussed above. 

In the Amendment mailed on November 15, 2007, Applicant pointed out that the terms at 
issue in the present application are not new or unknown biological material that the skilled 
artisan could easily miscomprehend. In Amgen v. Hoechst Marion Roussel , 314 F.3d 1313, 65 
U.S.P.Q.2d at 1398 (Fed. Cir. 2003), the Federal Circuit pointed out that in Enzo Biochem. v. 
Gen-Probe, Inc. , 296 F.3d at 1324, 63 U.S.P.Q.2d at 1613 (Fed. Cir. 2002), it was clarified that 
Eli Lilly did not hold that all functional descriptions of genetic material necessarily fail as a 
matter of law to meet the written description requirement; rather, the requirement may be 
satisfied if in the knowledge of the art the disclosed function is sufficiently correlated to a 
particular, known structure. The Federal Circuit continued stating that both Eli Lilly and Enzo 
Biochem. are inapposite to Amgen because the claim terms at issue ("vertebrate" and 
"mammalian") were not new or unknown biological materials that ordinarily skilled artisans 
would easily miscomprehend because they readily "convey distinguishing information 
concerning [their] identity" such that one of ordinary skill in the art could "visualize or recognize 
the identity of the members of the genus." Eli Lilly , 119 F.3d at 1567, 1568, 43 U.S.P.Q.2d at 
1406. 

The words or phrases "luciferase," "fluorescent protein," "chloramphenicol 
acetyltransferase," "beta-glucuronidase," "beta-galactosidase," "a PEST sequence" and "a 
mRNA destabilization sequence that is AU rich or forms a stem-loop" readily convey 
distinguishing information concerning their identity such that one of ordinary skill in the art can 
visualize or recognize the identity of the members of the genus. 

Accordingly, withdrawal of the § 1 12(1) "written description" rejection is respectfully 
requested. 

The 35 U.S.C. $103 Rejection 
Claims 1-11, 15-20, 25, 30-32, 34-37, and 41-44 were rejected under 35 U.S.C. § 103(a) 
as being unpatentable over Leclerc et al. ( BioTechniques . 29:590 (2000)), Corish et al, Protein 
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Eng. , 12:1035 (1999)), Gilon et al. (EMBOL, 17:2759 (1998)) and Kastelic et al. (WO 
00/39314). This rejection is respectfully traversed 

Leclerc et al prepared a construct in which a coding sequence for a firefly luciferase was 
linked to a C-terminal murine ornithine decarboxylase (mODC) coding sequence that includes a 
PEST sequence found near the C-terminus of wild-type mODC. It is disclosed that the PEST 
sequence in mODC corresponds to residues 423-450, and that residues 423-461 of mODC 
(modified by an amino acid substitution at two positions), i.e., the C-terminal residues of mODC, 
were fused to firefly luciferase sequences (see Figure 1). 

Leclerc et al. reports that the half-life of the firefly luciferase with the PEST sequence 
was 0,84 hours. It is disclosed that the construct allows for detection of rapid increases or 
decreases in gene expression (abstract), such as those which are transient dynamic changes or in 
response to gene inhibiting or activating secretagogues (page 600). The authors note that other 
factors such as mRNA stability and processing influence reporter activity. 

Corish et al. disclose constructs encoding green fluorescent protein (GFP) linked at the C- 
terminus to a 27 amino acid sequence from murine ODC that contains a PEST sequence or 
linked at the N-terminus to a 1 16 residue fragment from cyclin Bl that contains a destruction box 
(CDB), or both. Notably, the presence of both CDB and ODC sequences (at the N-terminus and 
at the C-terminus, respectively) resulted in a GFP protein having a half-life that was substantially 
the same (i.e., no additive or complementing effect) as the GFP protein with only the CDB 
sequence (see Figure 2) (at page 1037 in Corish et al., it is disclosed that "[t]he addition of the 
PEST motif to this CDB-GFP protein marginally reduced the average half-life further to 5.5 
hours. . .Thus, addition of the PEST region does not appear to make a significant difference to 
fluorescence when protein levels are non-limiting" (emphasis added). 

Therefore, it appears that the presence of the CDB sequence , which leads to 
ubiquitination at the N-terminus of GFP . dominates the half-life of the CDB-GFP-ODC fusion 
protein. 

Gilon et al. report on the isolation of a set of protein degradation signals for ubiquitin 
based proteolysis in yeast. 

Kastelic et al. disclose a construct having a coding region for luciferase linked to a 
mRNA instability sequence (clone # 63) and its use in a method to screen for compounds that 
promote the instability of mRNAs with mRNA instability sequences. The effect of the mRNA 
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instability sequence on luciferase expression in the results shown in Figure 4 is not particularly 
apparent until 24 hours after differentiation is induced, although the overall signal of clone # 63 
is 3- to 5-fold less than the control clone (clone # 53, which has a luciferase construct but no 
mRNA instability sequence). With reference to this data, it is disclosed that one "would expect 
in the case of luciferase to see a weaker effect of mRNA destabilizing drugs since transcription 
remains high" (page 12). 

Treatment of the clones with an agent known to promote the instability of mRNAs (a 
radicicol analog, i.e., SDZ 216-732) resulted in a decrease in activity in clone #63 over time 
(Figure 5 in Kastelic et al.). Treatment of the clones with actinomycin D (an agent that binds 
DNA at the transcription initiation complex and prevents elongation) resulted in a decrease in 
activity for both clones but relatively less of a reduction for clone # 63, while treatment of the 
clones with cycloheximide (an agent that inhibits protein synthesis) resulted in a decrease in 
activity clone # 53 and an increase in activity for clone # 63 (Figure 5). 

If there are fewer luciferase mRNAs available for translation in clone # 63 cells as a 
result of the mRNA instability sequence, it is unclear why clone # 63 has more activity than 
clone # 53 in the presence of actinomycin D or cycloheximide at 8 hours. Moreover, the reduced 
activity for clone # 63 in the presence of SDZ 216-732 relative to clone # 53 at 8 hours in Figure 
5 may be due to the reduced signal for clone # 63 (see Figure 4). 

In view of the disclosure in Corish et al., it was unexpected that different protein 
destabilization sequences could have substantially cumulative effects. 

Moreover, prior to Applicant's disclosure, it was unknown if two different protein 
destabilization sequences at the C-terminus of a reporter protein would result in a reporter 
protein with a half-life that was different than a reporter protein with one of the protein 
destabilization sequences, particularly in view of Corish et al., where the half life of the 
destabilized protein was almost exclusively due to a protein destabilization sequence at the N- 
terminus . 

With regard to combining protein and mRNA destabilization sequences, the purpose of 
the two types of destabilization sequences is different. As transcription precedes RNA 
processing and RNA processing precedes translation, a mRNA destabilization sequence 
decreases the amount of translatable mRNA, which in turn decreases the amount of translated 
protein, but the half-life of the translated protein is unaffected. For constructs with a mRNA 
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destabilization sequence, this allows for screening of agents that promote mRNA destabilization 
(see Kastelic et al). The presence of a protein destabilization sequence in a translated protein 
targets the protein for more rapid degradation than a corresponding protein without the protein 
destabilization sequence. A construct with a protein destabilization sequence would not be 
useful in screening for agents that promote mRNA destabilization. 

Moreover, the signal for a construct with a mRNA destabilization sequence was 3- to 5- 
fold less than a construct without the mRNA destabilization sequence (see Figure 4 in Kastelic et 
al.). Thus, prior to Applicant's disclosure, one of skill in the art would not consider combining 
RNA and protein destabilization sequences with a reporter gene, as the reduction in available 
reporter protein mRNA transcripts that can be translated in conjunction with a reduction in 
reporter protein half-life may not yield sufficient reporter protein for any particular assay. 

In this regard, the Examiner is requested to consider that the rapid and/or significantly 
larger increase in degradation in luciferase-CL-PEST or luciferase-protein destabilization 
sequence-mRNA destabilization sequence constructs was unexpected (Figures 9, 10B, 15 and 16 
in Applicant's specification). 

Accordingly, Applicant respectfully requests withdrawal of the § 103 rejection. 
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CONCLUSION 

Applicant respectfully submits that the claims are in condition for allowance and 
notification to that effect is earnestly requested. The Examiner is invited to telephone 
Applicant's attorney ((612) 373-6959) to facilitate prosecution of this application. 

If necessary, please charge any additional fees or credit overpayment to Deposit Account 
No. 19-0743. 



Respectfully submitted, 

SCHWEGMAN, LUNDBERG & WOESSNER, P.A. 
Attorneys for Intel Corporation 
P.O. Box 2938 

Minneapolis, Minnesota 55402 
(612) 373-6959 
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The antibiotic fusidic acid and certain closely related steroidal compounds 
are potent competitive inhibitors of the type I variant of chloramphenicol 
acetyltransferase (CATt). In the absence of crystallographic data for CATt, 
the structural determinants of steroid binding were identified by (1) 
construction in vitro of genes encoding chimaeric enzymes containing 
segments of CATj and the related type III variant (CATttt) and (2) 
site-directed mutagenesis of the gene encoding CATttt, followed by kinetic 
characterisation of the substituted variants. Replacement of four residues 
of CATjij (Gln92, Asnl46, Tyrl68 and Ilel72) by their equivalents from 
CATj yields an enzyme variant that is susceptible to competitive inhibition 
by fusidate with respect to chloramphenicol (K, = 5.4 uM). The structure of 
the complex of fusidate and the Q92C/N146F/Y168F/I172V variant, 
determined at 2.2 A resolution by X-ray crystallography reveals the 
inhibitor bound deep within the chloramphenicol binding site and in close 
proximity to the side-chain of His 195, an essential catalytic residue. The 
aromatic side-chain of Phel46 provides a critical hydrophobic surface 
which interacts with non-polar substituents of the steroid. The remaining 
three substitutions act in concert both to maintain the appropriate 
orientation of Phe 146 and via additional interactions with the bound 
inhibitor. The substitution of Gln92 by Cys eliminates a critical hydrogen 
bond interaction which constrains a surface loop (residues 137 to 142) of 
wild-type CATtji which must move in order for fusidate to bind to the 
enzyme. Only two hydrogen bonds are observed in the CAT-fusidate 
complex, involving the 3-a-hydroxyl of the A-ring and both hydroxyl of 
Tyr25 and NE2 of Hisl95, both of which are also involved in hydrogen 
bonds with substrate in the CATjji -chloramphenicol complex. In the acetyl 
transfer reaction catalysed by CAT, NE2, of Hisl95 serves as a general base 
in the . abstraction of a proton from the 3-hydroxyl of chloramphenicol as 
the first chemical step in catalysis. The structure of the CAT-inhibitor 
complex suggests that deprotonation of the 3-a-hydroxyl of bound fusidate 
by this mechanism could produce an oxyanion nucleophile analogous to 
that seen with chloramphenicol, but one which is incorrectly positioned to 
attack the thioester carbonyl of acetyl-CoA, accounting for the observed 
failure of CAT to acetylate fusidate. 
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Introduction 

Steroid hormones and related steroidal molecules 
are known to bind to a range of macromolecular 
targets in eukaryotes including nuclear receptors, 
intracellular enzymes involved in steroid biosyn- 
thesis or metabolism, and sex-hormone and corti- 
costeroid binding globulins present in plasma 
(Parker, 1993). Although the structure of the 
complex between progesterone and an anti-steroid 
Fab' fragment (Arevalo et a/., 1993), and unliganded 
structures of ketosteroid isomerase (Westbrook 
et a/., 1984), uteroglobin (Morize eta/., 1987), and a 
bacterial (3-hydroxysteroid dehydrogenase (Ghosh 
et a/., 1991), have been determined by X-ray 
crystallography the breadth of the spectrum of 
structural motifs for steroid binding by proteins 
remains unknown. In fact, the sole example of the 
structural determination of the complex of a 
steroidal ligand and an in vivo protein target is that 
recently determined for dehydroisoandrosterone 
bound to a bacterial cholesterol oxidase (Li et al. t 
1993), a flavin-dependent enzyme believed to have 
a role in the degradation of 3-P-hydroxysteroids, 
permitting the bacterium to utilise cholesterol as a 
sole source of carbon and energy 

It is well documented that the type I variant 
(CATj) of the enzyme chloramphenicol acetyltrans- 
ferase (CAT; EC 2.3.1.28), a common effector of 
bacterial resistance to the antibiotic chlorampheni- 
col (Shaw, 1967), can also confer resistance to the 
steroidal antibiotic fusidic acid when expressed in 
an Escherichia coli mutant which, unusually for 
Gram-negative bacteria, is fusidate-sensitive (Ben- 
nett & Shaw, 1983). Resistance to chloramphenicol 
is achieved via the enzyme-catalysed transfer of an 
acetyl group from acetyl-CoA to the primary 
hydroxyl group of the substrate, yielding 3- 
acetylchloramphenicol which fails to bind to, and 
thus no longer inhibits, the peptidyl-transferase 




centre of prokaryotic ribosomes. Fusidic acid is also 
an inhibitor of bacterial protein synthesis but is 
known to act at the elongation phase via stabilisation 
of the complex of elongation factor G and GDP 
(reviewed by Cundliffe, 1981). As binding by CATj 
leads neither to acetylation nor to any other covalent 
modification of fusidate, it is likely that the 
resistance mechanism is simply one of sequestering 
the antibiotic from its target (Bennett & Shaw, 1983). 

Genes encoding CAT have been isolated from 
many bacterial genera and nucleotide sequences of 
some 30 different natural variants are known, 
demonstrating 25 to 98% amino acid identity in 
pairwise comparisons. None are homologous in 
primary structure to known steroid-binding pro- 
teins, nor, with the exception of the acetyltrans- 
ferase component of the pyruvate dehydrogenase 
complex which has the same tertiary fold as CAT 
(Matte vi et a/., 1992), are they known to be related 
structurally to other proteins of any type. All CAT 
variants are homotrimers (3 x 25 kDa) with three 
active sites located in the clefts at the subunit 
interfaces (Shaw & Leslie, 1991) and operate via a 
rapid-equilibrium ternary complex mechanism with 
random order of addition of substrates (Kleanthous 
& Shaw, 1984). The imidazole of Hisl95 acts as a 
general base to abstract a proton from the primary 
hydroxyl of chloramphenicol, followed by attack of 
the resulting oxygen nucleophile at the carbonyl of 
the acetyl-CoA thioester. The resulting tetrahedral 
oxyanion intermediate, stabilised by hydrogen- 
bonding with the hydroxyl of Serl48, then collapses 
to yield the products 3-acetylchloramphenicol and 
CoA. Such a mechanism is supported by the results 
of chemical modification experiments (Kleanthous 
eta/., 1985) as well as X-ray crystallographic (Leslie, 
1990) and site-directed mutagenesis (Lewendon 
et al. t 1990, 1994) studies of the type III variant of 
CAT (CATip) (Murray et a/., 1988). In contrast, 
fusidate resistance appears to be a unique property 
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Figure 2. Sequence alignment of CATjjj and CATj. The primary sequences of CATjjj and CATj were optimally aligned 
and numbered as described by Shaw & Leslie (1991). Secondary structural motifs (helices and strands) observed in the 
crystal structure of CATjjj (Leslie, 1990), and the homologous regions of CATj, are underlined. The 17 residues which 
form the chloramphenicol binding site are shown in bold text. Positions of residue identity are indicated by * and those 
amino acids which are conserved in all known CAT variants by # . Double arrows above the sequence indicate the 
crossover points exploited in the construction of chimaeric CAT variants. 



of the CATj enzyme, encoded by the transposon 
Tn9 and many different enterobacterial R-plasmids. 
Fusidate binding to CATj is competitive with 
respect to chloramphenicol, despite the lack of 
obvious structural equivalence between the two 
ligands (Figure 1), and extends to various fusidate 
derivatives and related steroidal molecules (Bennett 
& Shaw, 1983). 

An understanding of the determinants of binding 
of ligands which are not substrate analogues to 
proteins is a prerequisite to rational drug design 
and to the production of novel enzymes by protein 
engineering. In the case of steroid binding by CATj, 
such an objective could in principle be achieved by 
determination of the structure of the enzyme/ 
fusidate complex by X-ray crystallography. How- 
ever, a prolonged and exhaustive effort has failed to 
yield crystals of CATj with suitable diffraction 
properties. Efforts to model the binding of fusidate 
to CATj using the known structure of the complex 
of CATjjj and chloramphenicol (Leslie, 1990; M.J.S., 
unpublished experiments) have been frustrated by 
two obstacles; a lack of residue conservation 
between CATj and CATjij (46% identity), even in 
the substrate binding sites, and the fact that a 
chloramphenicol site is located deep within each of 
the three inter-subunit clefts and could, in principle, 
be occluded (rather than occupied) by fusidate 



bound on the trimer surface. Random mutagenesis 
of the genes encoding either CATj or CATjjj was 
considered inappropriate due to the absence of a 
direct or facile selection for fusidate-sensitive 
mutants of the former, and the consideration that 
single point mutations were unlikely to confer 
fusidate resistance upon E. coli expressing CATjjj. 
However, starting with the assumption that the 
tertiary folds of CATj and CATjjj were likely to be 
essentially the same, and hence that steroid binding 
affinity is conferred primarily as a consequence of 
differences in amino acid side-chains, it seemed 
reasonable that site-directed mutagenesis of CATjjj 
could be useful in mapping the fusidate binding 
site. The desired result was achieved by the 
construction in vitro of recombinants expressing 
chimaeric CAT enzymes, followed by directed 
mutagenesis to produce variants of CATjjj which 
bind fusidic acid with high affinity and are 
amenable to crystallographic analysis. 

Results and Discussion 

Production and characterisation of CAT|/CATju 
chimaeric enzymes 

Steady state kinetic analysis of the inhibition of 
wild-type CATj and CATjjj by sodium fusidate 
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indicated that the steroidal inhibitor is bound 
-200-fold more strongly by the former (Id = 1.5 \M 
and 279 yM, respectively). In each case binding is 
competitive with respect to the substrate chloram- 
phenicol. However, because chloramphenicol binds 
in each of three deep clefts on the enzyme surface 
of CATjjj (and presumably CATj also) it cannot be 
presumed a priori that fusidate binding is deter- 
mined solely by the amino acid side-chains which 
form the substrate binding pocket. Calculations 
(using a sphere of radius 1.4 A) indicated that in the 
CATjjj-chloramphenicol complex (Leslie, 1990) only 
one chlorine atom and the two oxygen atoms of the 
p-nitro substituent are solvent-accessible. Thus, 
binding of fusidate to non-conserved surface 
residues flanking the entrance to the chlorampheni- 
col site (residues 26 to 30, 137 to 142 and 163 to 167; 
Figure 2) might in principle result in competitive 
inhibition by simply precluding access of substrate 
to the catalytic centre of CATj. Of the 17 amino acid 
side-chains which form the chloramphenicol bind- 
ing site of CATjjj, only nine are retained in CATj of 
which four are conserved in all CAT variants 
(Figure 2; Zhao & Aoki, 1992). As a first step in 
delineating regions of CATj which contained 
determinants of high affinity steroid binding, we 
constructed a series of recombinant CAT genes 
encoding chimaeric enzymes. 

Two crossover points were selected for construc- 
tion of CATi/CATiu chimaeras at locations which 
divide the CAT open reading frames approximately 
into thirds. The first crossover point, in a short 3io 
helix on the enzyme surface between residues 72 
and 73, was chosen because this site is known to 
tolerate insertions of additional amino acid residues 
while retaining activity (Betz & Sadler, 1981; I.A.M., 
unpublished experiments). The second was located 
between two highly conserved surface residues 
(Pro 151 and Trpl52) which form part of a reverse 
turn between (3-strands G and H. The fact that the 
two CAT variants are 46% identical in sequence and 
readily form heterotrimers of CATj and CATjjj 
subunits that are catalytically competent (Packman 
& Shaw, 1981; Day etal. t 1995) raised the possibility 
that some of the chimaeric enzymes might be active, 
notwithstanding the potential for folding defects 
and/or inappropriate intra- and inter-monomer 
interactions. Of six possible chimaeric proteins 
encoded by crossovers at these positions, only five 
produced CAT protein, four of which were able to 
acetylate chloramphenicol and three were charac- 
terised in respect of inhibition by sodium fusidate. 
Enzymes in which residues 6 to 71 of CATjjj 
replaced residues 1 to 71 of CATj and in which 
residues 152 to 221 of CATj replaced residues 152 to 
219 of CATjij (the IIM-I and III-III-I chimaeras) 
were purified by standard affinity chromatography 
methods and their steady state kinetic parameters 
for acetyl transfer to chloramphenicol and inhibition 
by fusidate were determined (Table 1). The former 
is inhibited by fusidate (ffj = 2.0 jaM) almost as 
effectively as is wild-type CATj (1.5 |iM), whereas 
the latter is ~ twofold less sensitive to fusidate 



Table 1. Steady state kinetic parameters determined for 
wild-type and chimaeric CAT variants 





fcat 


Km Cm 


Ki FA 


Variant 


(s- 1 ) 


(MM) 


(HM) 


Wild-type CATj 
Wild-type CAT nl 


97 a 


11- 


1.5 


599 b 


11. 6 h 


279 


II1-I-I 


9.3 


19 


2.0 


III-III-I 


191 


65 


505 


III (I139-143) 


534 


17 


193 



IIM-I, Chimaeric CAT variant wherein residues 1 to 71 of CAT] 
were replaced by residues 6 to 71 of CATjjj. 

III-III-I, Chimaeric CAT variant wherein residues 152 to 219 of 
CATjjt were replaced by residues 152 to 221 of CATj. 

Ill (I139 143). Chimaeric CAT variant wherein loop 139 to 143 
(VTPEN) of CATjjj is replaced by the equivalent loop (FIEN) of 

CATj. 

Kinetic parameters were determined as described in Materials 
and Methods. Parameters are the mean of a minimum of three 
independent determinations, and standard error values are 15% 
(or less) of the quoted values. Km values for the binding of the 
second substrate, acetyl-CoA, are within threefold of the values 
of the wild-type enzymes in the cases of all CAT variants 
analysed in the present study (data not shown). 

8 Data from Murray et al. (1991b). 

b Data from Lewendon ef a/. (1988). 



(Ki = 505 uM) than wild-type CAT m . A third 
chimaera (III-I-III) wherein residues 72 to 151 of 
CATjij are replaced by their equivalents from CATj 
could only be assayed in crude cell-free extracts 
by means of a very sensitive radiometric assay 
(Gorman et aJ., 1982) but did display fusidate 
sensitivity intermediate to that of the two wild-type 
enzymes assayed under similar conditions (data not 
shown). Taken together, such data suggest that the 
principal determinants of high affinity fusidate 
binding in CATj cannot reside in the N-terminal 
third of the protein and, additionally that the 
C-terminal third of CATj is insufficient by itself to 
confer CATj-like affinity for fusidate, only binding 
the ligand in the context of residues 72 to 151 of that 
variant. It seemed likely therefore that differences 
in steroid affinity are most likely a consequence 
of substitutions of some of those residues (92, 105, 
146, 168 and 172) known to be variable in the 
chloramphenicol binding site and/or in the surface 
loops 137 to 142 and 163 to 167. However, 
replacement of residues 139 to 143 (Val-Thr-Pro- 
Glu-Asn) of CATju by the shorter loop (Phe-Ile-Glu- 
Asn) from CATj yields an enzyme with kinetic 
parameters that are almost identical to those of 
wild-type CATjjj (Table 1). Therefore, site-directed 
mutagenesis of each of the eight non-conserved 
residues in the chloramphenicol binding site of 
CATjjj was used in an attempt to further define 
fusidate binding residues. 

Site-directed mutagenesis of residues of the 
chloramphenicol binding site of CATm 

Eight of the 17 amino acid side-chains which form 
the chloramphenicol binding site of CATjjt are 
substituted (Xaa) in CATj; Phe24(Ala), Tyr25(Phe), 
Leu29(Ala), Gln92(Cys), Alal05(Ser), Asnl46(Phe), 
Tyrl68(Phe) and Ilel72(Val). The Y25F mutant was 
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Table 2. Steady state kinetic parameters determined for 
singly and multiply substituted CATttt variants 





Ac* 


K m Cm 


Ki FA 


Variant 


(S" 1 ) 


(uM) 


(jaM) 


A. 








Wild-type CATjjj 


599 a 


11. 6 a 


279 


F24A/L29A 


500 


33 


133 


Y25F 


258 b 


15 b 


111 


N146F 


690 


42 


>2500 


Y168F/I172V 


358 


38 


255 


B. 

Q92C/N146F 


243 


20 


44 


Q92C/N146F 








Y168F/I172V 


377 


20 


5.4 


F24A/L29A 








Q92C/N146F 








Y168F/I172V 


112 


48 


3.8 


Wild-type CATj 


97 c 


ll c 


1.5 



"Data from Lewendon et ai. (1988). 
b Data from Murray et a/. (1991a). 
c Data from Murray et a/. (1991b). . . 



prepared and analysed by steady state kinetic 
methods in an earlier study (Murray et al. y 1991a). 
The substitutions Q92C, A105S and N146F were 
each introduced into CATttt as individual point 
mutations, whereas the F24A/L29A and Y168F/ 
1 172V double mutants were prepared using a single 
round of mutagenesis taking advantage of the close 
proximity of the codons of both pairs of residues in 
the CAT structural gene. The steady state kinetic 
parameters for chloramphenicol acetylation and 
inhibition by sodium fusidate for four such variants 
are presented in Table 2A. Data were not 
determined for the Cys92 and Serl05 enzymes 
because preliminary experiments were indicative of 
a fusidate sensitivity very similar to that of 
wild- type CATjjj in each case. With the exception of 
the N146F variant, each of the residue substitutions 
has only a minor influence on steroid binding. 
However, the substitution of Asnl46 by phenyl- 
alanine effectively abolishes competitive binding of 
fusidate by CATtjj. Indeed, whereas inhibition of 
the N146F enzyme is seen at very high concen- 
trations of inhibitor (>2.5 mM) it is impossible to 
distinguish between the effect of inhibitor binding 
to CAT and probable indirect effects of the 
detergent-like fusidate molecule on the partitioning 
of chloramphenicol between enzyme and solvent 
water (data not shown). Loss of affinity for the 
inhibitor accompanying the N146F mutation does 
not appear to be a consequence of any global 
changes in the integrity of the enzyme as judged by 
the near- wild-type kinetic properties of N146F in 
the acetyl transfer reaction. Therefore, and in spite 
of the fact that the N146F substitution has an effect 
opposite to that predicted, it seemed plausible that 
Phel46 might have an important role in binding of 
the inhibitor. In the structure of the binary complex 
of CATjji and chloramphenicol, the side-chain of 
Asnl46 is the centre of an extended hydrogen bond 
network which includes the amide side-chain of 
Gln92 and the phenolic hydroxyl of Tyrl68 (Fig- 



ure 3). As each of these hydrogen bond acceptors 
and donors is substituted by more hydrophobic 
amino acids (Cys92, Phel46, Phel68) in CATj it 
seemed likely that concerted substitution of all 
three residues might be necessary to produce a 
CATj-like fusidate binding phenotype. To that end 
we constructed and purified the Q92C/N146F 
double and Q92C/N146F/Y168F/I172V quadruple 
mutants of CATttt and determined their kinetic 
parameters (Table 2B). Introduction of Cys92 in the 
context of Phel46 enhances binding >50-fold 
compared to N146F, yielding an enzyme with 
sixfold greater fusidate (Ki = 44 jxM) affinity than 
that of wild-type CATjij. When these two substi- 
tutions are combined with those of Y168F and I172V 
the fusidate inhibition constant (Ki) falls to 5.4 jaM, 
a 50-fold enhancement in affinity over that of 
wild-type CATjji and approaching that of the type 
I variant. Further modification via the F24A and 
L29A substitutions results in only a modest increase 
in binding affinity (Ki - 3.8 jiM) supporting the 
conclusion of the chimaeric enzyme experiments 
that residues of the N-terminal segment of CATj 
play a limited role in steroid binding. 

Crystal structure of the 

f usidate-CAT m (Q92C/N1 46F/Y1 68F/I1 72V) 

complex 

The structure of the Q92C/N146F/Y168F/I172V 
variant of CATjji was determined at 2.2 A reso- 
lution in the presence of bound fusidate (Table 3). 
With the exception of the substituted residues and the 
different bound ligands the structure is essentially 




Figure 3. Hydrogen bond interactions in the complex of 
wild-type CATrjj and chloramphenicol. NE2 of Hisl95, a 
critical catalytic residue, acts as a general base to abstract 
the hydroxyl proton of chloramphenicol as the initial 
chemical step of the acetyl- transfer reaction. Residues 
Gln92, Asnl46 and Tyrl68, which form an extended 
hydrogen bond network on one side of the substrate 
binding site (Leslie, 1990), are each replaced by 
hydrophobic amino acid side-chains in CATj. 




Figure 4. Fusidate bound in the chloramphenicol binding site of the Q92C/N146F/Y168F/I172V variant of CAT m . 
The solvent-accessible surface (sphere radius 1.5 A) was calculated for all amino acid side-chains which contained atoms 
positioned within 6 A of atoms of the fusidate ligand. The four rings of the steroidal ligand are bound in a roughly 
cylindrical hydrophobic cavity with the A-ring at the base and D-ring closest to the surface of the trimer. The 
hydrophobic tail of the inhibitor projects out of the binding site onto the enzyme surface. 



identical to that of wild-type CATttt in complex 
with chloramphenicol (Leslie, 1990). The RMS 
differences between the structures of wild-type and 
quadruply substituted variants are 0.20 A and 
0.53 A for main-chain and side-chain atoms, 
respectively. Two residues, Vall39 and ThrHO, are 
disordered in the structure and were deleted from 
the model. Such disorder reflects the loss of a 
stabilising hydrogen bond interaction between the 
side chains of Gln92 and ThrHO (which accompa- 
nies the Q92C substitution) and is also observed in 
the complex between the quadruple mutant and 
chloramphenicol (A.G.W.L., unpublished data). In 
addition the side-chain of Hisl44 occupies two 
distinct positions due to alternate %i angles of +60° 
(c.f. wild-type CATttt) and -70°. As the peptide 
bond of His 144 shows the highest deviation from 
planarity (12°) of any residue in wild-type CATjjj, 
the alternate conformer of this side-chain may 
reflect the loss of some conformational constraint 
following substitution of Asnl46 by phenylalanine 
in the mutant. 

The side-chains of Cys92, Phel46, Phei68 and 
Vail 72 and the atoms of the four ring systems of the 
fusidate molecule are well ordered in the CAT-in- 
hibitor complex. The steroid A-ring is positioned at 
the base of the chloramphenicol binding pocket in 
close proximity to the imidazole side-chain of 
His 195, an essential catalytic residue (Lewendon 
et a/., 1994). The B, C and D rings also occupy the 



chloramphenicol site whereas the remainder of 
the inhibitor projects onto the enzyme surface and 
is more accessible to solvent. Atoms of the 
hydrophobic "tail", the O-acetyl, and carboxylate 
substituents of the inhibitor (Figure 1) are 
disordered. In spite of the apparent mobility of the 
latter it may contribute to fusidate binding via 
electrostatic interaction with the guanidinium of 
Arg28, a side-chain which occupies clearly defined 
density in this structure but is disordered beyond 
CB in crystals of wild-type CATtjj- 

As might be expected for a hydrophobic steroidal 
ligand, extensive non-polar interactions are ob- 
served in the CAT-fusidate complex (Figure 4). The 
C-14 methyl substituent (C-21) of the C-ring is in 
van der Waals contact (<3.6 A) with all six carbon 
atoms of the side-chain of Phel46. CD2 and CE2 of 
the same residue also interact with the C-12 
methylene group of the C-ring, supporting the 
earlier supposition that Phel46 is an important 
determinant of fusidate binding. CD2 of Leu 160 
contacts the C-4 methyl substituent (C-18) on the 
A-ring of fusidate and CD2 of Leu29 is involved in 
a similar interaction with the C-15 methylene group 
of the D-ring. In all, there are 57 contact distances 
of <4 A between atoms of fusidate and the protein 
(Table 4) and a further nine to four ordered water 
molecules in the binding site. Perhaps surprisingly 
none of the remaining three substituted residues 
(Cys92, Phel68 and Vall72) make direct contact 
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Table 3. Refinement statistics for the CAT-fusidate complex 



Reflections (6.0 to 2.2 A) 
fl-value 


Diffraction data 

13,517 (100% complete) 
17.4% 


Atomic model 


Mn nf atnms 


Mean isotropic 
thermal oarameter (A 2 ) 


Protein 

Fusidate 

Solvent 8 


1686 
37 
146 


19.7 
41.0 
37.5 


Stereochemical 

refinement 

parameter 


RMS deviation 
from ideal values 


Refinement restraint 
weighting values 


Bond distances (A) 
Bond angles (A) 
Planar 1-4 distances (A) 
Planes (A) 
Chiral volumes (A 3 ) 


0.018 
0.049 
0.059 
0.015 
0.166 


0.020 
0.040 
0.050 
0.020 
0.150 



a Includes 144 water atoms and two cobalt ions. 



with the bound inhibitor. However the side-chains 
of Cys92 and Phel68 almost certainly contribute to 
the van der Waals binding energy via interactions 
with C-26/C-27 of the hydrophobic tail and atoms 
of the C-16 (O-acetyl) substituent, which are poorly 
ordered in the crystal. It is also probable that these 
substitutions are required to permit the appropriate 
orientation of Phel46 and to provide a more 
hydrophobic environment for binding of the apolar 
ring systems. The observed abolition of fusidate 
binding when the N146F substitution is made in 
isolation most likely reflects the inappropriateness 
of introducing a hydrophobic side-chain in the 
context of the relatively hydrophilic environment 
provided by Gln92 and Tyrl68 of CAT ni . In 
wild-type CATjjj the amide nitrogen of Gln92 is 
involved in a hydrogen bond interaction with the 
side-chain hydroxyl of Thrl40 stabilising the 
conformation of a surface loop (residues 137 to 142) 
adjacent to the chloramphenicol binding site. 
Elimination of this H-bond, via the Q92C substi- 
tution, facilitates a major re-orientation of the loop 
which would otherwise preclude binding of the 
hydrophobic tail of fusidate (Figure 5). The 
truncation of Tyrl68 to phenyalanine is required to 
avoid a steric clash with the new position of residue 
146 and is accompanied by movement of the entire 
side-chain of Phel68 towards residue 172. This in 
turn necessitates the substitution of lie 172 by the 
shorter valine side-chain. 

The 3-a-hydroxyl group (0-6) of the A-ring is 
involved in hydrogen bonds with both NE2 of 
Hisl95 (2.4 A) and the phenolic-OH of Tyr25 (2.8 A; 
Figure 6). Both Tyr25 and Hisl95 are also involved 
in hydrogen bond interactions with chlorampheni- 
col when it is bound to wild-type CATjjj (Leslie, 
1990). As Tyr25 is replaced by phenylalanine in 
CATj it follows that the role of Hisl95 is the more 
important in respect of both chloramphenicol and 
fusidate binding. Indeed, when Tyr25 of CATm is 
replaced by phenylalanine it results in only minor 
changes in the kinetic parameters of the acetyl 



transfer reaction (Murray et ai„ 1991a). The 
observation that fusidate analogues wherein the 
3-a-hydroxyl is replaced by a P-OH (3-epifusidate) 
or keto substituent (3-oxofusidate) bind weakly to 
wild-type CATt (Bennett & Shaw, 1983) serves 
to emphasise the importance of the hydrogen 
bond with Hisl95. Pre-steady state kinetic analyses 
(Day et ai., 1995) show that the rate of fusidate 
dissociation from wild-type CATj increases when 
Hisl95 is replaced by alanine but that the equivalent 
substitution in CATjjj has no effect on inhibitor 
dissociation, implying that the hydrogen bond 
interaction is absent in wild-type CATm. Indeed, 
one explanation of the requirement for the concerted 
substitution of four residues in the chloramphenicol 
binding site of CATjjj is that they are necessary to 
permit access of the inhibitor to the base of the 
substrate binding pocket to form the hydrogen bond 
with Hisl95. 

NE2 of Hisl95 acts as a general base to abstract 
a proton from the primary (C-3) hydroxyl of 
chloramphenicol as the initial step in the acetyl 
transfer reaction. Although the 3-a-hydroxyl of 
bound fusidate could in principle also donate its 
proton to Hisl95 it is ~1.6 A removed from the 
position occupied by the primary (C-3) hydroxyl of 
chloramphenicol and in an entirely inappropriate 
orientation to attack the thioester carbonyl of 
acetyl-CoA and form a productive tetrahedral 
intermediate after proton abstraction (Figure 7). In 
addition, the C-2, C-3 and C-4 atoms of the A-ring 
of fusidate partly overlap with the positions of 
atoms of the tetrahedral intermediate (Lewendon 
et ai„ 1990). This accounts for the inability of 
fusidate binding CAT variants to acetylate the 
steroidal inhibitor. 

Using a sphere of radius 1.4 A as probe we 
calculated that 575 A 2 (or 81%) of the solvent-acces- 
sible area of fusidate becomes buried on binding to 
CAT. A similar proportion (89%, 1.7 A probe radius) 
of progesterone is buried in its complex with 
the anti-progesterone monoclonal antibody DB3 
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Table 4. CAT-fusidate contacts less than 4 A 



ruilUdLt aLUiii 


TAT atnm 


Distance (A) 


CI 


OH Tyr25 


3.59 




CG2 Thr94 


3.96 


C2 


OH Tyr25 


3.84 




CG2 Thr94 


3.50 




CE1 Phel03 


3.77 




CZ Phel03 


3.93 




0 Wat448 


3.84 


C3 


OH Tyr25 


3,82 




NE2 Hisl95 


3.49 




0 Wat436 


3.96 




0 Wat437 


3.83 


C4 


0 Wat437 


3.66 


C9 


CD2 Phel46 


3.91 


Cll 


CD2 Phel46 


3.89 


C12 


CE2 Phel35 


3.78 




CE2 Phel46 


3.14 




CD2 Phel46 


3.37 


C13 


CE2 Phel46 


3.77 


C15 


CD2 Leu29 


3.07 


C17 


CE2 Phel46 


3.89 




CZ Phel46 


3.98 


C18 


CD2 Leul60 


3.17 




NE2 His 195 


3.43 




CD2 Hisl95 


3.48 




0 Wat437 


3.87 


C19 


CB Phel46 


3.89 


C20 


CE1 Phe24 


3.85 




OH Tyr25 


3.47 




CE1 Tyr25 


3.68 




CD2 Leu29 


3.69 


C21 


CZ Phel46 


3.08 




CE2 Phel46 


3.10 




CE1 Phel46 


3.29 




CD2 Phel46 


3.38 




CD1 Phel46 


3.53 




CG Phel46 


3.57 


C22 


CZ Phel46 


3.86 




CE2 Phel46 


3.93 


C23 


CE2 Phel35 


3.94 




CE2 Phel46 


3.67 




CZ Phel46 


3.90 




0 Wat314 


3.66 


C24 


0 Wat-314 


3.68 


C26 


SG Cys92 


3.88 


C27 


SG Cys92 


3.66 




OG Serl07 


3.69 


C28 


CD2 Phel35 


3.04 




CE2 Phel35 


3.50 




CG Phel35 


3.82 




0 Wat314 


3.41 


C31 


CE2 Phel68 


3.78 




CD2 Phel68 


3.99 


C32 


CE2 Phel68 


3.11 




CD2 Phel68 


3.28 


01 


OH Tyr25 


3.54 




CE2 Phel35 


3.86 




U£ rneioD 


3.92 


02 


CZ Phel46 


3.90 


06 


OH Tyr25 


2.79 a 




CZ Tyr25 


3.41 




CE1 Tyr25 


3.93 




CZ Phel03 


3.59 




NE2 Hisl95 


2.42 a 




CE1 Hisl95 


3.15 




CD2 Hisl95 


3.33 




0 Wat436 


3.73 



a Hydrogen bond interactions. 



(Arevalo eta/., 1993), whereas dehydroandosterone 
is completely enclosed and sequestered from bulk 
solvent when bound to cholesterol oxidase (Li et ai., 
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1993). However, in the latter case it is probable that 
the eight-carbon hydrophobic tail of the substrate 
cholesterol (which is replaced by a keto group in 
dehydroandrosterone) projects out of the binding 
pocket in a manner strictly analogous to that seen 
in the CAT-fusidate complex. In each structure the 
steroid binding site is primarily formed from the 
side-chains of hydrophobic amino acid residues but 
the precise spectrum of contacts is quite different in 
each case. All three proteins utilise hydrogen bonds 
to histidyl residues to stabilise the oxygen atom 
(keto or hydroxyl) of the C-3 substituent of the 
A-ring. In cholesterol oxidase His447, which is 
hydrogen bonded to the 3-keto substituent via a 
bridging water molecule, is implicated in each of 
several postulated mechanisms for the oxidation 
reaction (Li et ai., 1993). 

Tight binding of steroidal and other highly 
apolar molecules to proteins requires not only a 
high degree of surface and steric complementar- 
ity between ligand and binding site but also a 
mechanism to address the energetic cost of : a 
hydrophobic binding pocket which is accessible to 
bulk solvent in the unliganded state. In the cases 
of progesterone binding to the DB3 Fab' (Arevalo 
et ai., 1993), and of dehydroandrosterone to 
cholesterol oxidase (Li et ai., 1993), desolvation is 
achieved by conformational changes in the pro- 
tein which accompany ligand binding. In the 
unliganded state the binding site of DB3 exists in 
a "closed" conformation, such that its hydro- 
phobic amino acids are shielded from bulk 
solvent, which can convert to an open state to 
permit access of the progesterone. In cholesterol 
oxidase, the substrate binding site is completely 
isolated from bulk solvent both in the absence 
(when the site is occupied by ordered water 
molecules) and in the presence of the steroid. 
Thus, a model for steroid binding to cholesterol 
oxidase requires that the ligand binding cavity 
first open to permit binding then reclose around 
the substrate, with the concomitant displacement 
of the ordered water. In contrast to the above 
examples, the binding of fusidate to CAT does not 
appear to be accompanied by conformational 
changes of the protein to facilitate, its access to 
the active site cleft at each subunit interface. This 
implies that CATjtt can tolerate the (presumably 
destabilising) effect of exposing three "new" 
hydrophobic residues to solvent in the unliganded 
state. In this respect it is perhaps significant that 
wild -type CATttj is an extremely robust enzyme, 
being highly resistant to thermal denaturation 
(Lewendon et ai., 1988) and remaining folded and 
trimeric in 8M urea (P.J.D.," unpublished exper- 
iments). Although the stability of the Q92C/ 
N146F/Y168F/I172V variant has not been 
studied, it should be noted that CATj, the 
chloramphenicol binding site of which is even 
more hydrophobic than that of the CATtjt 
quadruple mutant, is both less thermostable and 
less soluble than wild-type CATju (Day et ai., 
1 995) . 
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Figure 5. Surface loop movement accompanying the Q92C substitution. The structures of wild-type and the 
Q92C/N146F/Y168F/I172V variant of CATjjj were aligned by superposition of the main-chain atoms of residues 10 
to 130 and 150 to 210. Residues 92 and 135 to , 146 of the wild-type (white) and substituted (green) variants are shown 
in addition to the fusidate molecule bound to the latter. Note that the side-chains of residues 136-137 and 142-145 have 
been deleted from the image for the purpose of clarity Replacement of Gln92 by cysteine eliminates the hydrogen bond 
to Thrl40 permitting the movement of a surface loop (residues 137 to 142) which would otherwise preclude fusidate 
binding. Although Vail 39 and Thrl40 are disordered in the CAT-fusidate complex and therefore not shown, the loop 
movement is revealed by a shift of several A in the position occupied by Prol41. 



Conclusions 

We have used site-directed mutagenesis and 
X-ray crystallography to investigate the mechanism 
by which a single enzyme active site can bind two 
competing but chemically dissimilar ligands with 
approximately equal avidity The structure of 



fusidate bound to the Q92C/N146F/Y168F/I172V 
variant of CATju reveals how a single protein can 
confer resistance in vivo to two entirely different 
classes of antimicrobial agent and is the third 
example (after P-lactamase and CAT itself) of an 
antibiotic resistance mechanism which is under 
stood at the atomic level. It has been suggested that 
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Figure 6. Intermolecular hydrogen bonds between bound ligand and the side-chains of Tyr25 and His 195 occur in 
both CAT-chloramphenicol and CAT-fusidate complexes. 
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Figure 7. Superposition of bound fusidate inhibitor and the structure of the complex of wild-type CATjjj and the 
tetrahedral oxyanion intermediate of the acetyl-transfer reaction. In the acetyl-transfer reaction catalysed by CAT, 
abstraction (by NE2 of Hisl95) of the 3-hydroxyl (0-15) proton of chloramphenicol facilitates nucleophilic attack at the 
carbonyl carbon of acetyl-CoA leading to the formation of an oxyanion tetrahedral intermediate (0-) which is stabilised 
by a hydrogen-bond interaction with the side-chain of Serl48 (Lewendon, et a/.. 1990). While, in principle, proton 
abstraction from the 3-oc-hydroxyl (0-6) substituent of fusidate may also occur, the position and orientation of this 
hydroxyl group (and steric hindrance due to overlap with the C-2 and C-3 carbon atoms of the A-ring) preclude 
formation of a productive tetrahedral intermediate, accounting for the observed failure of CAT to acetylate the steroidal 
inhibitor. For the purpose of clarity only the A, B and C rings of fusidate and those parts of CoA immediately proximal 
to the tetrahedral intermediate (TI) are shown. 



co-administration of chloramphenicol and fusidate 
(or fusidate analogues) might offer a possible route 
to overcome CAT-mediated chloramphenicol resist- 
ance in clinical practice (Davies, 1994). Our 
structural data suggest that such a strategy may not 
be promising, since, of the diverse range of 
naturally occurring CAT variants, it is only CATj 
that carries the requisite motif of amino acid 
residues in the chloramphenicol binding site to 
generate a high affinity fusidate site. Conversely it 
is apparent that mutations leading to the substi- 
tution of amino acids within the chloramphenicol 
binding site of a fusidate-binding CAT, variant (c.f. 
CATt) might be expected to result in loss of 
inhibitor affinity without serious impairment of 
catalytic competence in the acetyl transfer reaction 
or consequent chloramphenicol resistance. It is not 
apparent whether CATj-mediated fusidate resist- 
ance is an evolved phenomenon or merely the result 
of a serendipitous arrangement of side-chains at the 
active site of this one variant. As only four of the 17 
residues of the chloramphenicol binding site are 
conserved among known CAT variants it has not 
been possible to infer the evolutionary relationships 
between members of the family Nonetheless, in the 
context of fusidate resistance, it is probably 



significant that CATj is of enterobacterial origin 
(including numerous genera that are relatively 
insensitive to fusidate due to limited outer 
membrane permeability) whereas naturally occur- 
ring CAT variants isolated from Gram-positive 
genera (commonly fusidate-sensitive) do not, to the 
best of our knowledge, confer resistance. 

The CAT-fusidate complex is the third example of 
the determination of the structure of a protein 
bound to a steroidal ligand and one wherein the 
architecture of the binding site and the likely 
mechanism of ligand binding are both quite distinct 
from those observed in previously published 
structures. Because the Q92C/N146F/Y168F/I172V 
variant of CATjij is only slightly compromised in 
the acetyl-transfer reaction (Table 2B) we believe 
that the active site of the protein is not significantly 
deformed in the absence of bound fusidate. This 
implies that the hydrophobic residues of the 
fusidate binding site are solvent-exposed in the 
unliganded enzyme and that, in contrast to 
cholesterol oxidase and the anti-progesterone Fab' 
DB3, significant conformational changes are not 
required to facilitate steroid binding. It is not known 
whether the determinants and dynamics of ligand 
binding to the effector recognition domains of 
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steroid receptors are analogous to those exemplified 
by the structures of the CAT-fusidate complex 
(binding at a preformed cleft), to that of DB3 and 
progesterone (opening of a closed binding site), or to 
the cholesterol oxidase-dehydroandrosterone com- 
plex (gating and reclosure of a preformed cavity). 
However, it is probable that the several discrete but 
linked functions of receptors of the steroid/vita- 
minD/thyroid hormone superfamily (Evans, 1988) 
will involve additional structural responses to ligand 
binding, favouring dimerisation and modulating the 
specificity and affinity of receptor interaction with 
DNA response elements. 

Materials and Methods 

Construction of recombinant genes and 
site-directed mutagenesis 

Regions of the DNAs encoding CATj (Alton & Vapnek, 
1979) and CAT ni (Murray et ai., 1988) were recombined 
in vitro using the technique of "sticky-feet" directed 
mutagenesis (Clackson & Winter, 1989). Single and 
multiple point mutations of CATjtt were introduced using 
oligonucleotide primers and single-stranded Ml 3 DNA 
templates loaded with deoxyuridine by preparation in the 
dut ung E. coli strain RZ1032 (Kunkel et ai„ 1987). The 
presence of desired mutations and the absence of second 
site changes were confirmed by DNA sequence determi- 
nation of both the complete coding and the 5' non-coding 
regions of the genes. 

Expression and purification of CAT 

Wild-type and mutant CAT proteins were expressed in 
E. coli JM101 after subcloning the coding sequences in 
pUCl8. Enzymes were purified from cell-free extracts by 
affinity chromatography using chloramphenicol-Sepha- 
rose (Lewendon et a/ M 1988) or by ion-exchange 
chromatography (DEAE-Sephacel) followed by affinity 
chromatography using Cibacron-blue Sepharose (Murray 
et al., 1991a). Homogeneity of purified enzymes was 
confirmed by SDS-polyacrylamide gel electrophoresis and 
enzyme concentrations were determined by the method 
of Lowry et a/. (1951). 

Assay of CAT activity and kinetic determinations 

CAT activity was assayed spectrophotometrically at 
25°C as described previously (Lewendon et a/., 1990). 
Standard assays contained 0.4 mM acetyl-CoA (prepared 
from the lithium salt of CoA, Pharmacia, by the method 
of Simon & Shemin, 1953), 0.1 mM chloramphenicol and 
1.0 mM 5,5'-dithiobis(2-nitrobenzoic acid) in TSE buffer 
(50 mM Tris-HCl (pH 7.5), 100 mM NaCl, 0.1 mM EDTA). 
One unit is defined as the amount of enzyme required to 
convert 1 i^mol of chloramphenicol to 3-acetylchloram- 
phenicol in one minute using the standard assay 
Concentrations of acetyl-CoA and chloramphenicol were 
varied in the standard assay for determination of steady 
state kinetic parameters and all assays were carried out 
in triplicate. Initial rate values were used to construct 
double reciprocal plots and kinetic parameters were 
derived from slope and intercept replots (Kleanthous & 
Shaw, 1984). K, values for competitive inhibition by 
sodium fusidate were determined by varying the 



concentrations of chloramphenicol and inhibitor in 
standard assays containing a fixed concentration of 
acetyl-CoA (routinely ~5xK m ), and were calculated 
from linear slope replots derived from double reciprocal 
plots. Crude extracts of chimaeric enzymes which 
appeared to be inactive in the standard assay were 
screened for low levels of CAT activity using a sensitive 
radiometric assay (Gorman ef a/., 1982). 

Crystallisation and structure determination 

Single crystals of the Q92C/N146F/Y168F/I172V 
variant of CATm were prepared by microdialysis using 
small "Lucite* buttons as described previously 
(Leslie, 1990). Each button contained 25 \i\ of protein 
(-SJmgml- 1 ) in 10 mM MES buffer (pH 6.3), or the 
same buffer supplemented with 0.5 mM sodium fusidate, 
and were dialysed at 4°C against 10 ml of 10 mM MES 
(pH 6.3) containing 2 to 4% (v/v) 2-methyl-2,4-pentane- 
diol, 1 mM hexamine cobalt (III) chloride, 0.1 mM 
dithiothreitol and either 1 mM chloramphenicol or 
0.5 mM sodium fusidate. Crystals were isomorphous 
with those of the wild-type enzyme; space group 
R32 (equivalent hexagonal cell dimensions a « 107.8 A, 
c= 124.1 A). 80° of data were collected to 2.2 A resolu- 
tion from a single crystal (dimensions 260 umx 
240 urn x 120 ftm) grown in the presence of fusidate using 
CuKa radiation from a GX1 3 rotating anode generator 
with double mirror collimation. Data were recorded on a 
prototype Hendrix-Lentfer image plate scanner with a 
diameter of 18 cm. The images were integrated with 
MOSFLM (Leslie, 1992) and programs from the CCP4 
Suite (1994). A total of 68,511 observations were reduced 
to a unique dataset of 14,238 reflections with a 
crystallographic merging i?-factor of 9.5% (24.9% in the 
highest resolution range). The dataset is 99% complete out 
to 2.2 A resolution, with an overall J/cx(7) ratio of 18.3 (7.6 
at 2.2 A resolution). 

The refined structure of the CATuj-chloramphenicol 
binary complex (Leslie, 1990) was used as a starting 
model for refinement. The chloramphenicol and all 
water molecules in the chloramphenicol binding pocket 
were removed from the model and the substitutions 
Q92C, N146F, Y168F and I172V made. This model was 
subjected to alternating rounds of refinement using the 
CCP4 programs SFALL, PROTIN and PROLSQ and 
manual rebuilding using the interactive graphics program 
O (Jones et al., 1991). After one round of refinement 
there was very clear density compatible with a model of 
fusidate derived from the crystal structure of fusidic acid 
methyl ester p-bromobenzoate (Cooper & Hodgkin, 1968) 
and this was included in the model for all subsequent 
refinement. After four rounds of model building and 
refinement the final fl-factor was 17.4% for all reflections 
in the resolution range 6.0 to 2.2 A. Computer graphics 
images were produced using the conic option (Huang 
ef a/., 1991) of the MidasPlus program (Ferrin eta/., 1988). 
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A Complex Mutant of TEM-1 (3-Lactamase with Mutations 
Encountered in Both IRT-4 and Extended-Spectrum TEM-15, 
Produced by an Escherichia coli Clinical Isolate 
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Escherichia coli GR102 was isolated from feces of a leukemic patient It expressed different levels of 
resistance to amoxicillin or ticarcillin plus clavulanate and to the various cephalosporins tested. The double- 
disk synergy test was weakly positive. Production of a p- lactamase with a pi of 5.6 was transferred to E. coli 
HB101 by conjugation. The nucleotide sequence was determined by direct sequencing of the amplification 
products obtained by PCR performed with TEM gene primers. This enzyme differed from TEM-1 (blaT-lB 
gene) by four amino acid substitutions: Met-»Leu-69, Glu-»Lys-104, Gly-»Ser-238 and Asn— »Asp-276. The 
amino acid susbstitutions Leu-69 and Asp-276 are known to be responsible for inhibitor resistance of the IRT-4 
mutant, as are Lys-104 and Ser-238 substitutions for hydrolytic activity of the extended-spectrum ^-lactamases 
TEM-15, TEM-4, and TEM-3. These combined mutations led to a mutant enzyme which conferred a level of 
resistance to coamoxiclav (MIC, 64 jxg/ml) much lower than that conferred by IRT-4 (MIC, 2,048 ixg/ml) but 
higher than that conferred by TEM-15 or TEM-1 (MIC, 16 jxg/ml). In addition, the MIC of ceftazidime for E. 
coli transconjugant GR202 (1 fig/ml) was lower than that for E. coli TEM-15 (16 |xg/ml) and higher than that 
for E. coli IRT-4 or TEM-1 (0.06 (xg/ml). The MICs observed for this TEM-type enzyme were related to the 
kinetic constants K m and A: ca , and the 50% inhibitory concentration, which were intermediate between those 
observed for IRT-4 and TEM-15. In conclusion, this new type of complex mutant derived from TEM-1 (CMT-1) 
is able to confer resistance at a very low level to inhibitors and at a low level to extended-spectrum cephalo- 
sporins. CMT-1 received the designation TEM-50. 



Overproduction of Escherichia coli chromosomal P-lacta- 
mase is one cause of resistance to p-lactam-p-lactamase in- 
hibitor combinations such as amoxicillin (AMX)-clavulanate 
(CA) and also results in reduced susceptibility to all p-lactams 
except carbapenems (15). 

In E. coli, resistance to all p-lactams except cephamycins and 
carbapenems may be caused by extended-spectrum p-lacta- 
mases. These enzymes are susceptible to p-lactamase inhibi- 
tors such as CA (10, 14, 15) and are therefore detected by 
synergy tests (10), and strains producing such mutants are 
often susceptible to p-lactam-p-lactamase inhibitor combina- 
tions. 

In £. coli isolates, the most recently discovered mechanism 
of resistance to AMX-CA is production of inhibitor-resistant 
TEM p-lactamases (IRT) (8). 

E. coli GR1.02, isolated from feces of a leukemic patient in 
the hematology unit of the teaching hospital of Grenoble, 
France, harbored an unusual p-lactam resistance phenotype 
with resistance to AMX and ticarcillin (TIC) alone and com- 
bined with CA and resistance to all cephalosporins, including 
cephamycins, at various levels. In addition, the double-disk 
synergy test used for extended-spectrum p-lactamase detection 
was weakly positive. 

This complex phenotype suggested that the p-lactam resis- 
tance of the strain was due to the presence of several p-lacta- 
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mases or a combination of different mechanisms of resistance 
to p-lactams. 

MATERIALS AND METHODS 

Strains. The strains used included E. coli GR102, a clinical isolate producing 
a novel p-lactamase; E. coli HB101, used as a recipient strain for transfer; E. coli 
HBlOI/plU (TEM-1 producing); E. coli CF0042 (IRT-4-TEM-35 producing) 
(8); and E. coli transform ant DH5a (CF244), obtained by electroporation from 
Klebsiella pneumoniae Kp240 (TEM-15 producing) (16). 

Susceptibility to p-lactams. The MICs of AMX, TIC, cephalothin (CF), ce- 
fotaxime (CTX), ceftazidime (CAZ), aztreonam (ATM), cefepime (FEP), and 
cefpirome (CPO) alone and combined with CA at a fixed concentration of 2 
u.g/ml were determined. A method of dilution with Mucller-Hinton agar (Sanofi 
Diagnostics Pasteur, Marnes-la-Coquette, France) and an inoculum of 10* CFU 
per spot were used. Antibiotics were provided as powders by SmithKline 
Bcccham Pharmaceuticals (AMX, TIC, and CA), Roussel-Uclaf (CTX and 
CPO), Glaxo Wellcome Research and Development (CAZ), and Bristol-Myers- 
Squibb (ATM and FEP). 

Detection of extended-spectrum p-lactamasc was performed with the double- 
disk synergy test as described by Jarlier et al. (10). 

Isoelectric focusing. Isoelectric focusing was performed with polyacrylamidc 
gels containing ampholincs with a pH range of 3.5 to 10.0 as previously described 
(19), and p-lactamascs with known pis (TEM-1 [pi 5.41, TEM-2 [pi 5.61, TEM-15 
I pi 6.0), and IRT-4 [pi 5.2)) were used as standards. 

Transfer experiment. A transfer experiment was performed with E. coli 
CR102 and the recipient E. coli HB101. Transconjugants were selected on agar 
containing rifampin (300 u-g/ml) and gentamicin (8 u,g/ml) or CAZ (0.5 u-g/ml). 

Sequencing of DNA amplified by PCR. On the assumption that the transcon- 
jugant strain contained Wr/ TEM , a single-stranded DNA template was generated 
for sequencing by PCR performed with an asymmetric ratio of amplification 
primers A and B, and the nucleotide sequence was determined as previously 
described (3), by direct sequencing of the amplified product obtained from the 
transconjugant E. coli GR202. 

Determination of P-lactamase kinetic parameters * cat , K m9 and k,.JK m . Af- 
finity (K m ) and catalytic activity (fc^,) were determined with highly purified 
extracts (£97% pure) by using a computerized microacidimetric method (13). 
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TABLE 1. Nucleotide and amino acid substitutions in bla TBM genes 
Nucleotide Nucleotide (amino acid)* in: 



Wfl TEM-l<Tn2> ^ TEM ., 5 6/fl| RT . 4 WflcMT-1 



226 


T (Phe) 


C 


T 


T 


317 


C (Gln-39) 


C 


C 


C 


346 


A (Glu) 


A 


G 


A 


407 


A (Met-69) 


A 


C(Leu) 


C(Leu) 


436 


T(Gly) 


C 


T 


T 


512 


G (Glu-104) 


A (Lys) 


G 


A (Lys) 


604 


T (Ala) 


G 


G 


T 


682 


T(Thr) 


T 


T 


T 


914 


G (Gly-238) 


A (Ser) 


G 


A (Ser) 


925 


G (Gly) 


G 


G 


G 


1022 


A (Asn-276) 


A 


G (Asp) 


G (Asp) 



" Nucleotide numbering is according to Sutcliflc (21). 

''The amino acid is indicated when a point mutation leads to an amino acid 
substitution compared with the sequences of TEM-I(Tn2). Numbering is accord- 
ing to Ambler et al. (1). 



All p-lactamases were purified from crude extracts by size exclusion chromatog- 
raphy on Scphadcx G-100 (Pharmacia), preparative isoelectric focusing, and 
reverse -phase high-performance liquid chromatography on a C, K Nuclcosil 500A 
column (Interchim) as described by Brun et al. (2). The homogeneity of the 
preparations was determined by analytical sodium dodccyl sulfatc-polyacrylam- 
idc gel electrophoresis. The k CM , K r „ and k c JK m values of enzyme CMT-1 were 
compared with those of the p-lactamascs TEM-i, TEM-15, and IRT-4. The 
kinetics of TEM-1 and its mutants toward penicillins and cephalosporins were 
compared. Inhibition studies of TEM-1 and the mutant enzymes with CA, sul- 
bactam, and tazobactam were performed. The affinity of the enzyme for the 
inhibitor, expressed as the inhibition constant (K,), was measured by using com- 
petition procedures with bcnzylpenicillin. It is determined from the extrapolated 
rate at the time when the inhibitor is added. The 50% inhibitory concentration 
(IC 5U ) was determined after incubation of the inhibitor and the enzyme for 10 
min (completed inactivation) at 37°C before measurement of the remaining 
enzymatic activity. The IC 50 is defined as the inhibitor concentration causing 
50% inhibition of benzylpcnicillin hydrolysis by the enzyme. 

RESULTS 

Resistance phenotype of E. coli GR102. £. coli GR102 ex- 
pressed a complex p-lactam resistance phenotype with resis- 
tance to AMX and TIC alone and combined with CA and 
resistance to cephalosporins at various levels: high-level resis- 
tance to narrow-spectrum cephalosporins and low-level resis- 
tance to extended-spectrum cephalosporins (MICs, 1 to 32 
fxg/ml). A positive synergy test with CA suggested the presence 
of a mutant extended-spectrum p-lactamase of class A origin. 
In addition, this strain had reduced susceptibility to cefoxitin 
(MIC, 128 fjLg/ml) and, to a lesser extent, cefotetan and moxa- 



lactam (data not shown). This reduced susceptibility to cepha- 
mycins was probably related to decreased permeability of the 
strain for p-lactams, since E. coli GR102 had lost an outer 
membrane protein with a molecular mass of 40 kDa (data not 
shown). Overproduction of the chromosomal cephalospori- 
nase was not detected. 

This strain was also resistant to aminoglycosides (tobramy- 
cin, gentamicin, and netilmicin), probably owing to production 
of an AAC (3)-II enzyme, and to chloramphenicol, tetracy- 
clines, and sulfonamides. 

Conjugative transfer and isoelectric focusing. The gene en- 
coding resistance to p-lactams, except cephamycins, was tran- 
f erred by conjugation from £. coli GR102 to rif ampin-resistant 
E. coli HB101 (GR202). Selection for CAZ or gentamicin re- 
sistance revealed the transfer of an 85-kb plasmid conferring 
resistance to p-lactams, aminoglycosides (tobramycin, genta- 
micin, and netilmicin), tetracyclines, and sulfonamides. 

By isoelectric focusing, two bands at pis 5.4 and 5.6 were 
observed in E. coli GR102 and one band at pi 5.6 was observed 
in E. coli transconjugant GR202. 

Nucleotide sequencing. As shown in Table 1, from the trans- 
conjugant GR202 producing a p-lactamase with a pi of 5.6, 
nucleotide sequencing revealed a bla TBM gene identical to the 
blaT-lB gene (Tn-2) at positions 226, 317, 346, 436, 604, 682, 
and 925, which discriminate the 6/0tem genes (4, 7). 

The bla TBM gene from the £. coli transconjugant differed 
from the blaT-lB gene by four point mutations. These muta- 
tions consisted of the nucleotide change A-»C at position 407, 
which leads to the amino acid substitution Met— »Leu at posi- 
tion 69 (1); the nucleotide change G— >A at positions 512 and 
914, leading to the amino acid substitutions Glu— >Lys at posi- 
tion 104 and Gly-^Ser at position 238; and the nucleotide 
change A— >G at position 1022, leading to the amino acid 
substitution Asn— >Asp at position 276. The two amino acid 
substitutions at positions 69 and 276 are observed in the IRT- 
4-TEM-35 enzyme (2, 8, 22), and the two amino acid substi- 
tutions at positions 104 and 238 are observed in the extended- 
spectrum p-lactamase TEM-15 (16). 

P-Lactam MICs for TEM mutants. Consequently, we com- 
pared the MICs of p-lactams for E. coli GR102 and its trans- 
conjugant GR202, producing a complex mutant form of TEM 
(CMT-1), with MICs for IRT-4-producing E. coli CF0042, 
TEM-15-producing E. coli CF244, and TEM-l-producing E. 
coli HB101 (Table 2). 

For £. coli GR202 producing CMT-1, the MICs of AMX- 
CA (64 u,g/ml) and TIC-CA (64 u,g/ml) were much lower than 



TABLE 2. MICs of p-lactams for E. coli CMT-1 (GR102 and its transconjugant, GR202), E. coli TEM-15 (CF244), 
E. coli IRT-4 (CF0042), and E. coli TEM-1 (HB101) 



E. coli strain 



AMX 



TIC 



CF 



CTX 



MIC (fig/ml) of: 



CAZ 



ATM 



FEP 



CPO 





Alone 


With 
CA" 


Alone 


With 
CA 


Alone 


With 
CA 


Alone 


With 
CA 


Alone 


With 
CA 


Alone 


With 
CA 


Alone 


With 
CA 


Alone 


With 
CA 


GR102 (CMT-1) 


4,096 


256 


>4,096 


512 


256 


128 


4 


0.5 


4 


1 


1 


0.25 


8 


1 


32 


4 


GR202* (CMT-1) 


2,048 


64 


4,096 


64 


8 


4 


1 


0.03 


1 


0.12 


0.12 


0.06 


1 


0.03 


2 


0.12 


CF244 C (TEM-15) 


>4,096 


16 


>4,096 


32 


128 


4 


8 


0.06 


16 


0.25 


4 


0.12 


1 


0.03 


2 


0.03 


CF0042 (IRT-4) 


4,096 


2,048 


1,024 


512 


8 


4 


£0.06 


550.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


HB101 (TEM-1) 


4,096 


.16 


4,096 


32 


8 


2 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


HB101" 


4 


4 


■ 1 


1 


4 


4 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 


£0.06 



" CA was used at 2 ^g/ml. 
b E. coli HB101 transconjugant. 
**£. coli DH5a transformant. 
a E. coli recipient strain. 
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TABLE 3. Production of TEM-type p-lactamases in E. coli 



Enzyme 


Pi 


Producing 
organism 


Activity in 
crude extract" 


Sp act* of 
purified protein 


TEM-1 


5.4 


HB101 


2.2 


2,48 


TEM-35-1RT-4 


5.2 


CF0042 


2.1 


2.17 


TEM-15 


6.0 


CF244 


0.4 


0.083 


TEM-50-CMT-1 


5.6 


GR202 


0.4 


0.23 



" Micromolcs of benzylpcnicillin per minute per milligram of protein. 
h Micromolcs of benzylpcnicillin per minute per microgram of protein. Deter- 
mined with highly purified preparations (£97% pure). 



those observed for IRT-4-producing strain CF0042 (2,048 and 
512 fxg/ml, respectively). Similarly, for strain GR202, MICs of 
CTX (1 jxg/ml) and CAZ (1 |Ag/ml) were lower than those 
observed for TEM-15-producing strain CF244 (8 and 16 (xg/ml, 
respectively) and higher than those for IRT-4 producing strain 
CF0042 (<0.06 ^g/ml). The same 1:8 ratio of MICs was ob- 
served for aztreonam (0.12 fig/ml for the CMT-1 producer and 
4 fxg/ml for the TEM-15 producer). The MICs of cefepime and 
cefpirome (1 and 2 jxg/ml) were identical for the CMT-1 and 
TEM-15 producers. 

E. coli GR202 (CMT-1) was 2 to 4 times less susceptible to 
AMX or TIC plus CA (64 jxg/ml) and 16 times less susceptible 
to CTX, CAZ, FEP, and CPO (1 to 2 »xg/ml) than was E. coli 
HB101 (TEM-1 4 ). MICs of p-lactam substrates in the pres- 
ence of 2 and 4 »xg of sulbactam or tazobactam per ml were 
about fourfold lower for the CMT-l-producing strain than for 
the TEM-1 -producing strain (data not shown). 

Enzymatic and kinetic parameters of p-lactamases. Enzy- 
matic and kinetic parameters of the new complex mutant en- 
zyme CMT-1 with regard to penicillins and cephalosporins 
were compared with those of the TEM-1, IRT-4, and TEM-15 
p-lactamases (Tables 3 to 5). The specific activity of the highly 
purified CMT-1 protein was 10-fold lower than that of TEM-1 
(Table 3). 

For all penicillins, the k cia values of CMT-1 were about 
10-fold lower than those of TEM-1 and IRT-4 and about twice 
as high as those of TEM-15. The catalytic efficiencies (k c .JK m ) 
of the three mutant enzymes were lower than those of the 



TABLE 4. Comparison of the kinetics" of TEM-1 
and its mutant forms 



*„, (s-)- K m (nM), k c JK m OxM-'S-') 



Drug 


TEM-1 


TEM-35I- 
IRT-4 


TEM-15 


TEM-50- 
CMT-1 


Benzylpcni- 


1.200, 25, 48.0 


1,050, 140, 7.5 


40, 6, 6.7 


110, 17,6.5 


cillin 










AMX 


920, 26, 35.4 


900, 245, 8.5 


26, 5, 5.2 


70, 33, 2.1 


TIC 


115, 10, 11.5 


125,320, 0.4 


8, 2, 4.0 


15, 30, 0.5 


Carbenicillin 


132, 13, 10.2 


120, 360, 0.3 


7, 3, 2.3 


13, 60, 0.2 


Piperacillin 


987, 45, 21.9 


945. 320, 2.9 


64, 12, 5.3 


111,31,3.6 


CF 


.122, 250, 0.5 


52, 1,200, 0.O4 


43, 23, 1.7 


62, 324, 0.2 


Ccphaloridine 


2,045. 800, 2.5 


340, 1,420, 0.2 


37, 30, 1.2 


320,310, 1.03 


Cefopcrazonc 


470, 260, 1.8 


305, 1,325, 0.2 


25, 22, 1.1 


150, 118, 1.3 


Ccfuroximc 


ND ft 


ND, 


24, 91, 0.3 


20, 260, 0.08 


Ceftriaxone 


ND 


ND, 


92, 50, 1.8 


35, 385, 0.09 


CTX 


1.2, ND 


ND, 


180, 100, 


150, 873, 0.2 


CAZ 


ND 


ND, 


1.8 
7, 80,0.1 


3, ND 


ATM 


ND 


ND, 


0.2, ND 


1, ND 



"The standard deviation for analysis was S10%. 

h ND, not detected; the rate was too small to determine k mx and K m reliably. 
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TABLE 5. Inhibition of TEM-1 and its mutant forms by CA, 
sulbactam, and tazobactam 



Enzyme 




IC50 (nM), K (»iM) 




CA 


Sulbactam 


Tazobactam 


TEM-1 


0.08, 0.1 


6.1, 0.9 


0.1, 0.01 


TEM-35-IRT-4 


28, 27 


304, 49 


1.8, 0.6 


TEM-15 


0.01, 0.02 


0.03, 0.02 


0.01, 0.008 


TEM-50-CMT-1 


0.25, 0.7 


0.5, 0.4 


0.04, 0.06 



TEM-1 enzyme, and the values observed for CMT-1 and IRT-4 
with carboxy- and ureidopenicillins were similar. 

For cephalosporins, £ CIit values of CMT-1 were slightly lower 
than or similar to those of TEM-15 for ceftriaxone, CTX, 
CAZ, and ATM; however, the catalytic efficiencies of the 
CMT-1 enzyme were only 5 to 11% of those of TEM-15 with 
ceftriaxone and CTX. No activity of TEM-1 or IRT-4 against 
expanded-spectrum cephalosporins and ATM (& cat s of <1 s" 1 
associated with of 2:500 |xM) was detected. 

The IC 50 of CA for CMT-1 (Table 5) was higher (0.25 |xM) 
than that for TEM-1 (0.08 |xM) and TEM-15 (0.01 jiM) but 
100-fold lower than that for IRT-4 (28 jjlM). Sulbactam was the 
least efficient inhibitor of IRT-4 (IC 50 , 304 u,M), while its 
inhibitor efficiency was similar to that of CA for CMT-1 (IC 50 , 
0.5 ixM). Tazobactam was the most efficient inhibitor of all of 
these p-lactamases. Moreover, CMT-1 (IC 50 , 0.04 jxM) and 
TEM-15 (IC 5() , 0.01 |xM) were more susceptible to inhibition 
by tazobactam than was TEM-1 (IC 50 , 0.1 |xM). 

DISCUSSION 

The TEM-1 derivative described in this report constitutes a 
new type of complex mutant, CMT-1, combining mutations 
responsible for inhibitor resistance (Leu-69 and Asp-276) and 
those responsible for extended-spectrum activity (Lys-104 and 
Ser-238). It is the first example of such a (^-lactamase produced 
by a clinical isolate of E. coli. 

Mutations conferring resistance to p-lactam inhibitors. Re- 
placement of methionine 69, just adjacent to serine 70, by 
aliphatic amino acids such as leucine influences the positioning 
of residues (5, 17) because the buried side chain at position 69 
lies behind p-strand B3, forming the back wall of the oxyanion 
pocket in which the p-lactam's carbonyl group is polarized 
(12). Moreover, crystallographic data indicate that residues in 
the C-terminal a helix, such as Asn-276, restrict the mobility of 
the Arg-244 side chain and so play a role in maintaining the 
integrity of the active site (11). Because small p-lactams such 
as CA must rely primarily on attractive interactions with the 
oxyanion hole and Arg-244, inhibitor resistance exists in the 
natural variant IRT-4, containing changes at residues 69 and 
276 (12). This IRT-4 mutant enzyme is one of the most resis- 
tant to inhibition by CA among the IRT-type enzymes (2, 22), 
with a CA IC 50 350-fold higher than that for TEM-1. The CA 
resistance of this mutant is confirmed by high-level resistance 
to combinations of AMX and TIC with CA (MIC, 2,048 and 
512 ixg/ml, respectively). Inhibition studies showed that the 
CMT-1 enzyme was 100-fold less resistant than the IRT-4 
mutant but only 3 times as resistant to inhibition by CA as the 
wild-type TEM-1 p-lactamase. These kinetic results were close- 
ly related to the moderate resistance level (64 (xg/ml) of the 
CMT-l-producing strain to AMX-CA or TIC-CA. The inhib- 
itor resistance usually caused by the Leu-69 mutation may be 
decreased by the close proximity of the Ser-238 mutation (see 
below). 
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Mutations conferring extended-spectrum activity. The Glu-» 
Lys change at position 104 contributes to the precise position- 
ing of residues 130 to 132 (SDN loop), which are involved in 
substrate binding, but seems insufficient alone to confer true 
resistance to expanded-spectrum cephalosporins (18, 20). 

It is generally recognized that the substitution Gly-*Ser-238 
enlarges the active site, thereby creating an enzyme with in- 
creased affinity for the 7-oxyimino cephalosporins (6). AH of 
the TEM variants reported to have Ser-238 contain methionine 
at position 69, and mutant enzyme CMT-1 is the first harboring 
both mutations Ser-238 and Leu-69. The affinity may be af- 
fected by a change at position 69, since the side chain at 
position 238, on the inner side of the B3 P-strand, lies very 
close to the side chain of residue 69 (12). 

The TEM-1 variant, with the associated changes Glu-»Lys- 
104 and Gly^Ser-238, is TEM-15 (16). Complex mutant CMT-1, 
with these last mutations, conferred a lower level of resistance 
to CTX and CAZ than did TEM-15, and this difference cor- 
related with the kinetic constants. The kinetic comparison of 
substrate hydrolysis in extended-spectrum ^-lactamases TEM- 
15 and CMT-1 revealed that the catalytic efficiency (k c JK rn ) of 
CMT-1 was lower than that of TEM-15 for CTX (10-fold) and 
ceftriaxone (20-fold). 

Overall, the hydrolytic properties of this complex TEM mu- 
tant enzyme were found to be closer to those of an extended- 
spectrum enzyme than to those of an inhibitor-resistant en- 
zyme. However, the predominant effect of the mutations Lys- 
104 and Ser-238, which are responsible for extended-spectrum 
activity and inhibitor hypersusceptibility, was clearly attenu- 
ated by the mutations Leu-69 and Asp-276. In the strain pro- 
ducing CMT-1, complete reversal of CA resistance by muta- 
tions enhancing activity against 7-oxyimino cephalosporins was 
not observed as reported in a Ser-164-Ser-244 mutant ob- 
tained by site-specific mutagenesis (9). 

E. coli GR102 was isolated from a patient treated with CPO 
(4 g/day) and amikacin (1 g/day) for 12 days. An E. coli strain 
with a typical p-lactam inhibitor resistance phenotype and the 
same resistances to other antibiotics had been isolated from 
the same sample (feces) from this patient a week before. This 
suggests that an inhibitor-resistant TEM mutant enzyme with 
the Leu-69 and Asp-276 mutations emerged first, and then, 
under antibiotic (CPO) and mutagenic agent (cytarabin and 
daunorubicin) pressure, this mutant underwent the two addi- 
tional mutations, Lys-104 and Ser-238, responsible for extend- 
ed-spectrum activity. Unfortunately, this hypothesis could not 
be confirmed since the initial E. coli IRT-producing strain was 
not kept. 

In conclusion, the production of this complex TEM mutant 
cannot alone account for the high-level multi resistance to 
P-lactams of the E. coli GR102 isolate, in which several resis- 
tance mechanisms were involved (TEM-1 and, probably, de- 
creased permeability). It would probably be more beneficial for 
an E. coli strain to produce two different TEM mutants (an 
extended-spectrum mutant and an inhibitor-resistant mutant) 
simultaneously than to produce one double mutant. If each 
mutant conferred its own resistance phenotype, high-level re- 
sistance to both CA combinations and extended-spectrum 
cephalosporins could then be expected. 
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p-Lactamases inactivate penicillin and cephalosporin antibiotics by hydrolysis of the (Mactam ring and are 
an important mechanism of resistance for many bacterial pathogens. Four wild-type variants of Staphylococcus 
aureus P-lactamase, designated A, B, C, and D, have been identified. Although distinguishable kinetically, they 
differ in primary structure by only a few amino acids. Using the reported sequences of the A, C, and D enzymes 
along with crystallographic data about the structure of the type A enzyme to identify amino acid differences 
located close to the active site, we hypothesized that these differences might explain the kinetic heterogeneity 
of the wild-type P-lactamases. To test this hypothesis, genes encoding the type A, C, and D P-lactamases were 
modified by site-directed mutagenesis, yielding mutant enzymes with single amino acid substitutions. The 
substitution of asparagine for serine at residue 216 of type A P-lactamase resulted in a kinetic profile 
indistinguishable from that of type C P-lactamase, whereas the substitution of serine for asparagine at the 
same site in the type C enzyme produced a kinetic type A mutant. Similar bidirectional substitutions identified 
the threonine-to-alanine difference at residue 128 as being responsible for the kinetic differences between the 
type A and D enzymes. Neither residue 216 nor 128 has previously been shown to be kinetically important 
among serine-active-site p-lactamases. 



P-Lactam antibiotics, including the penicillins and cephalo- 
sporins, are important agents in the therapy of bacterial infec- 
tions. However, in some clinical settings the usefulness of these 
agents has been diminished by the emergence and spread of 
bacterial strains that produce p-lactamase, which hydrolyzes 
the p-lactam ring and inactivates the drug's antimicrobial effect 
(27). This problem has been demonstrated most dramatically 
with Staphylococcus aureus. Whereas the vast majority of clin- 
ical isolates of S. aureus were highly susceptible to penicillin G 
at the time of its introduction into clinical use in the early 
1940s, the spread of p-lactamase-producing, penicillin-resis- 
tant strains was so widespread by the late 1940s that penicillin 
G was no longer a reliable antistaphylococcal agent. Most 
clinical isolates of 5. aureus produce p-lactamase (36). 

Four types of 5. aureus p-lactamase have been identified by 
serologic (34, 35) and kinetic (19, 20) methods. These variants 
originally were designated types A, B, C, and D by Richmond 
(34) and Rosdahl (35). This nomenclature should not be con- 
fused with that of the different classes of p-lactamases, A 
through D, that has been used more recently to group the 
p-lactamases of all bacterial species on the basis of active site 
(serine versus zinc), size, and kinetic characteristics (4). Each 
of the four recognized types of S. aureus p-lactamase (A, B, C, 
and D) is a class A p-lactamase with a serine active site. The 
mature form of the enzyme has a molecular mass of 30 kDa, 
contains 257 amino acids, and is excreted extracellularly (1). 

The type A and C staphylococcal p-lactamases are easily 
distinguished kinetically, either by substrate profile (18) or by 
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K m and catalytic constant (k cal ) determinations (43). Whereas 
the kinetics of hydrolysis exhibited by the type A and D staph- 
ylococcal p-lactamases are similar (20), they can be rapidly and 
reproducibly distinguished by a fivefold difference in the ratio 
of the rates of the initial velocity of hydrolysis of penicillin G 
and cefazolin (43). The lower penicillin G/cefazolin hydrolysis 
ratio of the type D p-lactamase appears to be related to a lower 
£ cal for the hydrolysis of penicillin G by purified type D enzyme 
than the other staphylococcal p-lactamases (43). 

The genes encoding the type A, C, and D p-lactamases of 5. 
aureus are located on plasmids and have been cloned and 
sequenced (5, 8, 9, 40). The deduced amino acid sequences 
identify six amino acid differences in the primary sequence of 
the prototypic type A p-lactamase from strains PCI and SI 
compared with the type C sequence from strain 3804, including 
amino acids 116, 202, 205, 206, 212, and 216. In addition, there 
are five differences, at amino acids 93, 121, 128, 226, and 229, 
between the prototypic type A p-lactamase and the type D 
P-lactamase produced by strain FAR4. The type C and D 
enzymes differ by 11 amino acids. 

The molecular basis for the kinetic heterogeneity exhibited 
by the staphylococcal p-lactamases has not been determined. 
However, the structure of the type A variant of S. aureus 
P-lactamase is known from X-ray crystallographic analysis (11, 
12), and several of the amino acids which are different between 
the type A, C, and D p-lactamases are located close to the 
active-site cleft. To test the hypothesis that single amino acid 
differences at sites close to the active site are responsible for 
the kinetic heterogeneity exhibited by naturally occurring vari- 
ants of S. aureus p-lactamase, we constructed mutant P-lacta- 
mases with single amino acid substitutions at sites where the 
wild-type enzymes differ and evaluated the kinetics of the mu- 
tant enzyme. 
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TABLE 1. Plasmids and phagemids used in this study 



Plasmid 


Host 


Resistance" 


pSl 

pII3804 

pUBlOl 

pE194 

pBC SK+ 

pTZ18R 

pVKlOO 


S. aureus 
S. aureus 
S. aureus 
S. aureus 
E. coli 
E. coli 

S. aureus-E. coli 


Amp 

Amp 

Amp 

Em 

Cm 

Amp 

Em, Cm 


pVKlOl 


S. aureus-E. coli 


Em, Cm 


pvjviuz 
pVK103 
pVK104 
pVK105 
pVK106 
pVK107 


E. coli 

E. coli-S. aureus 
E. coli 

E. coli-S. aureus 
E. coli-S. aureus 
E. coli-S. aureus 


Amp 
Amp, Em 
Amp 
Amp, Em 
Amp, Em 
Amp, Em 


pVK108 


E. coli-S. aureus 


Amp, Em 


pVK109 


E. coli-S. aureus 


Amp, Em 


pVKllO 


E. coli-S. aureus 


Amp, Em 



Description 



Source or 
reference 



From strain SI, carries blaZ encoding type A (3-Iactamase 18 

From strain 3804, carries blaZ encoding type C (3-lactamase 9 

From strain FAR4, carries blaZ encoding type D (3-lactamase 25 

From strain RN2442, carries gene encoding Em r 13 

Phagemid vector Stratagene 

Phagemid vector Bio-Rad 

E. coli-S. aureus shuttle plasmid constructed by cloning 3.7-kb pE194 This study 

on a Sacl site into Sail of pBC SK+ 

pVKlOO with a 4-kb £coRI-//wdIII fragment carrying ^-lactamase This study 

regulatory genes from pI3796 but lacks blaZ 

pTZ18R with 4-kb Hindi fragment with type A blaZ from pSl This study 

pVKlOl with 1.1-kb Hindlll fragment with type A blaZ from pVK102 This study 

pBC SK+ with 8-kb EcoRl fragment with type C blaZ from pII3804 This study 

pVKlOl with 1.8-kb Hindlll fragment with type C blaZ from pVK104 This study 

pVKlOl with 1.3-kb Hindlll fragment with type D blaZ from pUBlOl This study 

pVKlOl with 1.1-kb Hindlll type A blaZ fragment except a single This study 

nucleotide substitution leading to S216N change 

pVKlOl with 1.1-kb Hindlll type A blaZ fragment except a single This study 

nucleotide substitution leading to T128A change 

pVKlOl with 1.8-kb Hindlll type C blaZ fragment except a single This study 

nucleotide substitution leading to N216S change 

pVKlOl with 1.3-kb Hindlll type D blaZ fragment except a single This study 

nucleotide substitution leading to A128T change 



" Amp, ampicillin; Cm, chloramphenicol; Em, erythromycin. 



MATERIALS AND METHODS 

Chemicals and media. Standard powders of nitrocefin (BBL Microbiology 
Systems, Cockeysville, Md.): cephaloridine (Sigma Chemical Co., St. Louis, 
Mo.): methicillin, ampicillin, and cephapirin (Bristol Laboratories, Syracuse, 
N.Y.); and cefazolin and penicillin G (Eli Lilly & Co., Indianapolis, Ind.) were 
used to prepare antibiotic solutions for kinetic studies. Cation-exchange resin 
Pll (Whatman Laboratories, Kent, England) and meta-aminophenyl boronic 
acid hemisulfate and succinamide-activated sepharose (Sigma) were used for 
preparing columns for (S-lactamase purification. Restriction endonucleases, T4 
DNA ligase, and Sequenase were purchased from United States Biochemical 
Corp., Cleveland, Ohio. A Muta Gene Phagemid kit (Bio-Rad Laboratories, 
Fullerton, Calif.) was used for site-directed mutagenesis. The oligonucleotides 
required for site-directed mutagenesis and sequencing of the ^-lactamase gene 
were synthesized on a Cyclone Plus Automatic DNA synthesizer (Millipore, 
Bedford, Mass.) by the DNA Core Facility. Department of Molecular Physiology 
and Biophysics, Vanderbilt University. Modified 1% CY broth was prepared as 
described by Novick (29). LB and 2x YT media were prepared according to the 
methods of Maniatis el al. (26). Tryptic soy broth was purchased from BBL. 

Bacterial strains, plasmids, and cultivation. The plasmids and phagemids 
used and constructed during this study are listed in Table 1. S. aureus RN4220 
(R. P. Novick, Public Health Research Institute, New York, N.Y.) was used as a 
recipient for protoplast transformation. Escherichia coli DH5a was used for the 
transformation and propagation of E. coli plasmids and S. aureus-E. coli shuttle 
plasmids except when indicated otherwise. 

Construction of S. aureus p-lactamase expression vector. An E. coli-S. aureus 
shuttle vector, pVKlOl, was constructed with erythromycin as a selectable 
marker in S. aureus, by using a strategy similar to that employed by East et al. (8) 
to construct S. aureus (3-lactama.se expression vector pAE706. First S. aureus 
plasmid pE194 was digested with Sail and cloned into the Sacl site of pBC SK+ 
(Stratagene, La Jolla, Calif.) to create pVKlOO. Next blaR (p-lactamasc regula- 
tory gene) from S. aureus plasmid pI3796 was cloned on a 4-kb ////idIII-£coRI 
fragment into pVKlOO to produce pVKlOl. 

Cloning of type A, C, and D blaZ. Large-scale isolation of plasmid DNA from 
S. aureus by ultracentril'ugation in a cesium chloride gradient (Var lac oid 
Chemical Co., Inc., Bergenfield, N.J.) was performed by methods described by 
Galetto el al. (10). A 4-kb Hindi fragment carrying ihe blaZ (p-lactamase 
structural gene) from pSl was cloned into pTZ18R to create pVK102. //widlll 
digestion of pVK102 enabled the type A blaZ to be mobilized on a 1.1-kb 
fragment which was cloned into pVKlOl. This produced pVK103 in which blaZ 
and blaR are transcribed divergently from the same intracistronic region. A 
similar strategy was used to clone the type C blaZ from pII3804 into pVKlOl via 
a pBC SK+ intermediate (pVK104) to produce pVK105. The plasmid DNA 
from pUBlOl was digested with Hindlll, and a 1.3-kb fragment carrying type D 
blaZ was cloned in the desired orientation into pVKlOl to produce pVK106. 

Site-directed mutagenesis of p-lactamasc. Oligonuclcotidc-dircctcd mutagen- 
esis was performed by the method of Kunkcl ct al. (23, 24). Hindlll fragments 



containing the type A, C, and D blaZ genes were cloned individually into the 
phage vector M13mpl8-(42). Two oligonucleotides, each with a single mismatch 
(underlined), were used to introduce specific mutations at amino acids 128 
(5 'TCACTATATGCCATTGAAGCC3 ' ; Thr to Ala) and 216 (5'AGTATCTC 
CGTTTTTATTATT3 ' ; Ser to Asn) of the type A blaZ. Additional primers were 
constructed to produce the reverse mutation in the type C and DbhZ genes (i.e., 
5 ' AGTGTCTCCGC 1' 1 T 1 ATTATT, Asn to Ser, type C blaZ and 5'ATCACT 
ATATGTCATTGAAGC, Ala to Thr, type D bhiZ). Single-stranded DNA from 
the six randomly picked plaques was sequenced by the dideoxy chain termination 
method (38) using Sequenase enzyme to confirm the desired mutation. The 
complete open reading frame was sequenced in order to rule out the presence of 
any spurious mutations. 

Expression of wild-type and mutant ^-lactamases in S. aureus RN4220. Wild- 
type blaZ encoding type A, C, and D (S-lactamases and mutant blaZ genes from 
M13mpl8 recombinants were cloned as 1.1- lo 1.8-kb Hindlll fragments into the 
Hindlll site of S. aureus-E. coli shuttle plasmid pVKlOl. Recombinants were 
selected on LB medium containing ampicillin (100 u.g/ml) and chloramphenicol 
(30 u.g/ml). The desired orientation of the insert was verified either by restriction 
with £coRV, which has a single site within type A and D blaZ and one just 
upstream of the blaZ promoter within blaR, or by PCR. Protoplast transforma- 
tion was used to transfer the recombinant shuttle plasmids into S. aureus RN4220 
(6). 

p-Lactamase purification from S. aureus. The wild-type p-lactamase-produc- 
ing strains and the RN4220 transformants were grown in 5 liters of modified \% 
CY medium containing methicillin (0.5 |xg/ml) or 2-(2'-carboxyphenyl)benzoyl- 
6-aminopenicillanic acid (7.5 u,M, to induce 3-lactamase production) at 37"C and 
150 rpm for 18 h. The extracellular p-lactamascs were purified by sequential 
ca l ion-exchange and affinity chromatography (21), . 

Enzyme kinetics. Initial velocities of hydrolysis were monitored at a wave- 
length corresponding to the maximal change in absorbance between the unhy- 
drolyzed substrate and the hydrolyzed product, which included the following: 
cephaloridine, 254 nm; cefazolin, 272 nm; nitrocefin, 482 nm; cephapirin, 258 
nm; ampicillin, 235 nm; and penicillin G, 232 nm (37, 39). p-Lactamase assays 
were performed in 0.1 M sodium phosphate buffer, pH 6.0, in 1-cm cuvettes at 
37°C with a DU-70 recording spectrophotometer (Beckman Instruments, Ful- 
lerton, Calif.). For K m and & cat determinations, assays of the initial velocity of 
hydrolysis were performed using 100, 50, 33.3, 20, 14.2, and 11.1 u,M solutions of 
each cephalosporin antibiotic. Penicillin hydrolysis assays were performed at 
initial substrate concentrations of 1,000, 500, 333, 200, 142, and 111 u.M. The 
maximal rate of hydrolysis, V max , and the Michaelis constant, K m , for each 
substrate-enzyme combination were determined from [jj/w-against-^] plots 
(where [s] is substrate concentration and v is velocity) (41) with computerized 
software (Hyper; Department of Biochemistry, University of Liverpool, Liver- 
pool, United Kingdom). The turnover number, /c ca „ was calculated from the V max 
by using a molecular mass of the purified ^-lactamase of 30,000 g/moi. Mean 
values and standard error of the mean values were calculated from the results of 
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FIG. 1. Tertiary structure of.V. aureus PCI (3-laclamase. The active-site cleft 
of the p-laclamase is located at the left side of the 03 strand. The active-site 
amino acids are S-70, S-130. N-132, K-234, and R-244. Amino acids T-128 and 
S-216 were substituted by site-directed mutagenesis of the cloned structural 
gene. The diagram was generated by the computer software MOLSCRIPT (22) 
using X-ray crystal lographic coordinates stored in the Brookhaven Protein Da- 
tabase (accession no. 3BLM). 



8 to 12 [ 1 y]/i'-against-[.v] plots for each enzyme-substrate combination by using 
computer software (Minitab Data Analysis Software, release 10.2; Minitab, Inc., 
State College, Pa.). 

RESULTS 

Construction and expression of mutant p-lactamases. The 

primary goal of this study was to determine the functional 
domains, particularly the specific amino acid(s), responsible 
for the kinetic heterogeneity observed among naturally occur- 
ring variants of S. aureus p-lactamase. Mutant p-lactamases 
with single amino acid substitutions at residues 128 (Thr to 



Ala) and 216 (Ser to Asn) were constructed by using oligonu- 
cleotide-directed mutagenesis and the type A p-lactamase 
gene (blaZ) as a template. In addition, the reverse mutations 
were introduced by using the type D (Ala to Thr, residue 128) 
and type C (Asn to Ser, residue 216) blaZ genes as template 
DNA. These residues were selected primarily because of the 
amino acids which differed among the type A, C, and D p-lac- 
tamases, 128 and 216 were closest to the active-site cleft (Fig. 
1). Each blaZ was cloned into E. coli-S. aureus shuttle plasmid 
pVKlOl, which was then transformed into S. aureus RN4220. 
p-Lactamase production was induced, and the wild-type p-lac- 
tamases expressed by the reference strains and the mutant 
enzymes expressed in RN4220 were purified to homogeneity. 

Effect of substitution at amino acid 216 on the kinetics of 
type A and C p-lactamases. p-Lactams which have been shown 
previously to be useful in distinguishing between the wild-type 
5. aureus p-lactamases (20) were used to compare the kinetics 
of hydrolysis of the reference and mutant enzymes (Tables 2 
and 3). Between the type A and type C p-lactamases there 
were a 10-fold difference in the K m values of cefazolin, a 5-fold 
difference in the K m values of cephapirin, and a 5-fold differ- 
ence in the k cal values of nitrocefin. These differences appear 
to be due to whether Ser or Asn was present at residue 216. 
Replacement of Ser by Asn in the type A p-lactamase yielded 
a mutant (A, S216N) that was closer kinetically to the type C 
than the type A enzyme (e.g., cefazolin K m values: 167 u,M, 
mutant; 145 |xM, type C; 13.1 |xM, type A). The reverse mu- 
tation using type C blaZ DNA for site-directed mutagenesis 
yielded a mutant (C, N216S) that was similar to the type A 
enzyme (cefazolin K m value, 15.3 p,M). Also, the differences 
between the kinetic type A and type C S. aureus p-lactamases 
are clearly shown by comparing relative efficiency of hydrolysis 
values (Table 4). The k CM values of most substrates other than 
nitrocefin did not clearly distinguish between the type A and 
type C enzymes (Table 3). 

Effect of substitution at amino acid 128 on the kinetics of 
type A and D p-lactamases. The reference type A p-lactamase 
had a cefazolin K, n value that was threefold lower than and an 
ampicillin K m value that was twofold greater than the respec- 
tive K, n values of the type D enzyme. In addition, the /c cat 
values of ampicillin and penicillin G were threes to fourfold 
higher with the type A compared to the type D p-lactamase. 
These differences were related to whether Ala or Thr was 
present at residue 128. Replacement of Thr by Ala in the type 
A p-lactamase yielded a mutant (A, T128A) that was closer 
kinetically to the type D than the type A enzyme (e.g., peni- 
cillin k^ values: 47 s~\ mutant; 254 s~\ type A; 66 s~\ type 



TABLE 2. K m values of p-lactam antibiotics for purified wild-type and mutant p-lactamases of S. aureus" 

Mean K„, [|iM (SEM)] for p-lactamase 



Wild type Altered by site-directed mutagenesis 

Antibiotic ' , 





pSl. type 
A 


pII3804, 
type C 


pUBlOl. 
type D 


pVK107 

(A, 
S216N) 


pVK108 

(A 
T128A) 


pVK109 

(C 
N216S) 


pVKUO 

(D, 
A128T) 


Cephaloridine 


4.3 (1.0) 


6.3 (1.8) 


4.6(1.2) 


6.5 (1.7) 


5.3 (1.3) 


4.5 (0.7) 


4.8 (1.1) 


Cefazolin 


13.1 (2.8) 


145 (18.7) 


38.3 (9.3) 


167 (35) 


43.9 (6.6) 


153 (2.7) 


13.3 (1.8) 


Cephapirin 


4.5 (1.2) 


24.9 (1.7) 


7.1 (1.7) 


27.3 (3.9) 


7.0 (0.8) 


5.5 (1.5) 


4.5 (1.0) 


Nitrocefin 


5.1 (1.3) 


10.5 (2.3) 


7.1 (1.6) 


11.6 (2.3) 


6.8 (1.5) 


6.7 (1.5) 


6,6 (1.7) 


Ampicillin 


195 (35) 


128 (22) 


119 (22) 


153 (14) 


118 (15) 


208 (31) ' 


219 (43) 


Penicillin G 


29.2 (6.5) 


25.5 (6.9) 


26.0 (8.2) 


29.0 (9.1) 


34.3 (9.4) 


31.3 (8.6) 


19.7 (6.5) 



" The initial velocities of hydrolysis of 100, 50, 33.3, 20. 14.2, and 11.1 u.M solutions of the cephalosporins and 1,000, 500, 333, 200, 142, and 111 u,M solutions of 
ampicillin and penicillin G were monitored with a recording spectrophotometer. K m values were determined by the use of [^]/v-against-|s] plots. Each value represents 
the mean from 8 to 12 determinations. K n , values that are altered by amino acid replacements are in boldface. 
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TABLE 3. k cM values of p-lactam antibiotics for purified wild-type and mutant (^-lactamases of 5. aureus 









Mean k QM 


value [s- 1 (SEM)]" for | 


3-lactamase 






Antibiotic 




Wild type 






Altered by site-directed mutagenesis 




pSl, type A 


pII3804, 
type C 


pUBlOl, 
type D 


pVK107 

(A, 
S216N) 


pVK108 (A, 
T128A) 


pVK109 (C, 
N216S) 


pVKHO (D. 
A128T) 


Cephaloridine 

Cefazolin 

Ccphapirin 

Nitrocefin 

Ampicillin 

Penicillin G 


1.5 (0.3) 
1.4 (0.1) 
0.37 (0.01) 
33.7 (4,6) 

560 (49) 

254 (17) 


2.6 (0.2) 

1.8 (0.1) 
0.6 (0.01) 

6.9 (0.4) 
249(14) 
140 (5) 


0.97 (0.2) 
2.1 (0.1) 
0.21 (0.02) 

16.3 (2.0) 
133 (23) 

65.6 (6) 


3.4 (0.4) 
2.51 (0.3) 
0.75 (0.1) 
8.0 (0.9) 

353 (28) 

162(14) 


0.83 (0.1) 
2.24(0.1) 
0.19 (0.03) 
15.3 (3.4) 
91 (10) 
47.3 (6.8) 


0.74 (0.17) 
0.86 (0.1) 
0.27 (0.02) 
33.1 (4.1) 

392 (33) 

173 (16) 


1.37 (0.1) 
0.92 (0.1) 
0.36 (0.01) 
37.0 (4.4) 

423 (46) 

192 (12) 



■ Mean value in molecules of antibiotic hydrolyzed per second per molecule of enzyme, as determined from [j]/v-against-(s] plots and assuming a molecular mass of 
the purified (^-lactamase of 30.000 g/mol. Each value is derived from 8 to 12 determinations, values that are altered by amino acid replacements are in boldface. 



D). Hie reverse mutation using type D blaZ DNA for site- 
directed mutagenesis yielded a mutant (D, A128T) that was 
similar to the type A enzyme (penicillin G k cal value, 192 s" 1 ). 
The type A and D enzymes are easily distinguished by the ratio 
of the rates of hydrolysis of penicillin G and cefazolin (Table 
5). 

DISCUSSION 

Naturally occurring variants of S. aureus p-lactamase can be 
distinguished on the basis of the kinetics of hydrolysis of se- 
lected penicillin and cephalosporin antibiotics (20, 43). In this 
study we have shown that these kinetic differences are deter- 
mined by single amino acid substitutions at positions close to 
the active site of the enzyme. Specifically, the presence of an 
Asn instead of Ser at residue 216 determines a type C kinetic 
profile, and the presence of an Ala instead of Thr at residue 
128 determines a type D kinetic profile. Enzymes exhibiting the 
type A kinetic profile have Ser and Thr at these two sites, 
respectively. We also found that substitutions at some positions 
other than 128 and 216 where the type A, Q and D enzymes 
have different amino acids did not alter the kinetic profile of 
the enzyme (e.g., amino acid 121 [data not shown]). 

This situation is reminiscent of what has been reported for 
the newer TEM-type (3-laclamases and SHV 3-lactamase vari- 
ants among gram-negative bacterial species (17). Broad-spec- 
trum TEM and SHV variant p-lactamases capable of hydro- 
lyzing ceftazidime, cefotaxime, and/or other newer 
cephalosporins have become problematic in many medical cen- 
ters in recent years, especially among isolates of Klebsiella 
pneumoniae and E, coli (14, 17, 31). As with the £ aureus 



enzymes, the altered kinetic profile of the broad-spectrum 
TEM-type p-lactamases is based on modest changes in primary 
structure, generally one to three amino acid substitutions at 
sites close to the active site (32). In addition, single mutations 
that result in TEM and SHV variant p-lactamases exhibiting 
resistance to commercially available p-lactamase inhibitors 
such as clavulanic acid have been described (7, 15). 

A major difference between the histories of the variant TEM 
and staphylococcal enzymes, however, is that whereas the 
former appear to be a consequence of selective antibiotic pres- 
sure in the clinical setting, the S. aureus enzymes appear to 
have remained remarkably stable over time. Although p-lac- 
tamase-producing strains of S. aureus spread widely during the 
first few years following the clinical introduction of penicillin, 
some clinical isolates collected and saved prior to penicillin use 
have been shown to produce the A and C variants of S. aureus 
P-lactamase. Furthermore, the prevalence of the various types 
of staphylococcal p-lactamases among clinical isolates in the 
United States in the 1980s (19, 20) are similar to what was 
reported in England by Richmond in the mid-1960s when he 
first described the existence of different staphylococcal p-lac- 
tamase serotypes (34). Despite the widespread use since the 
early 1960s of antistaphylococcal penicillins such as methicillin, 
oxacillin, and nafcillin, new staphylococcal p-lactamases capa- 
ble of hydrolyzing these agents efficiently have not been ob- 
served. 

Both of the amino acid positions that we have shown to be 
responsible for the kinetic differences among the wild-type S. 
aureus enzymes have not previously been cited as contributing 
to p-lactamase function. The major kinetic difference between 
the type A and C enzymes is the K m values of certain cepha- 



TABLE 4. Relative efficiencies of hydrolysis for 0-lactam antibiotics of purified wild-type and mutant p-lactamases of S. aureus 



REH" (%) of p-lactamase 



Antibiotic 




Wild type 






Altered by site-directed mutagenesis 






P S1. 


pII3804, 


pUBlOl. 


pVK107 


pVK108 


pVK109 


pVKllO 




type A 


typcC 


type D 


(A, S216N) 


(A, T128A) 


(C, N216S) 


(D, A128T) 


Cephaloridine 


100 


100 


100 


100 


100 


100 


100 


Cefazolin 


32 


3 


24 


3 


28 


24 


25 


Cephapirin 


24 


6 


14 


5 


17 


30 


29 


Nitrocefin 


1,943 


159 


1,095 


133 


1,406 


1,980 


2,004 


Ampicillin 


820 


473 


562 


449 


481 


868 


690 


Penicillin G 


2,565 


1,336 


1,200 


1,071 


863 


2,208 


3,578 



" Relative efficiency of hydrolysis (REH) is a relative value of & cal //C m . These values are expressed as a percentage of the REH for cephaloridine by the same 
p-lactamase (i.e., the cephaloridine REH was calculated in liters per mole per second and assigned a value of 100), determined from the mean k cat and K m values for 
each enzyme-substrate combination. REH values that are altered by amino acid replacements are in boldface. 
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TABLE 5. Ratios of the rates of hydrolysis of penicillin G and 
cefazolin by kinetic type A and D ^-lactamases 



p-Lactamase (type) 



Relative k a 
ratio" 



Fixed-concn 
ratio* 



Wild type 

pSl (A) 178.7 183.5 

pUBlOl (D) 32.5 36.6 
Altered by site-directed mutagenesis 

P VK108 (A, T128A) 23.6 45.7 

pVKHO (D, A128T) 208.5 162.5 

" Ratio of the values of penicillin G and cefazolin. 
b Determined from the initial velocities of hydrolysis using a 500 u,M concen- 
tration of penicillin G and 100 u,M concentration of cefazolin. 



losporins, particularly cefazolin and cephapirin (Table 2). 
These enzymes differ not only in hydrolysis of certain cepha- 
losporin substrates but also in the inhibition profile of some 
P-lactamase inhibitors, including sulbactam (unpublished ob- 
servations) and tazobactam (3), with the type A enzyme being 
more susceptible to inhibition. 

The substitution at residue 216 might affect p-lactamase 
structure and function in several ways. First, the side chain of 
Asn is bulkier, (the accessible surface area of Asn is 158 A [15.8 
nm] [28]) than that of Ser (122 A [12.2 nm]) and may be 
hindering the substrate binding into the active-site cleft. Mod- 
elling studies with cefazolin docked into the PCI p-lactamase 
active-site cleft show that the side chain C 3 substituent in 
cefazolin is positioned close to side chain of Ser-216 (unpub- 
lished observations) and substitution of Asn for Ser at this 
position would result in steric hindrance. Second, the refined 
crystal structure of PCI p-lactamase at 2 A (0.2 nm) indicated 
that the amino acid 216 is located on a short 3 10 helix com- 
prising amino acids 215 to 217 and this helix is stabilized 
through a helix N capping (33) between Asn-214, which is 
highly conserved among class A p-lactamases (2), and Ser-216 
(O-N, 2.9 A [0.29 nm]). In addition to this, the side chain OH 
of Ser-216 is also involved in a hydrogen bond with Asn-214 
side chain carbonyl (O-O, 3.3 A [0.33 nm]). Substitution of Asn 
for Ser at 216 may alter the topology of this short helix. The 
crystal structure of a type C enzyme could help to clarify the 
structure of this loop, and attempts are under way to crystallize 
the type C enzyme. Third, amino acid 216 is located close to 
the p3 strand, and changes at this residue might alter the 
relative positioning of other active-site amino acids such as the 
K-T-G triad. 

The reason why the replacement of Thr by Ala at residue 
128 should affect enzyme function is less clear. The effect of the 
substitution was primarily a reduction in the fc cal of the peni- 
cillins and nitrocefin along with modest effects on the K m 
values of cefazolin and ampicillin (Table 2). The crystal struc- 
ture of PCI indicates that amino acid 128 is located at the C 
terminus of a-helix 4 close to the active-site cleft. It is two 
residues away from the highly conserved S-D-N loop (amino 
acids 130 to 132) of the class A p-lactamases. The catalytic 
function of the S-D-N loop has been verified by site-directed 
mutagenesis of Streptomyces albus G p-lactamase (16) and E. 
coli TEM p-lactamase (30). The proximity of amino acid 128 to 
the catalytically important S-D-N loop might explain the ki- 
netic differences between the type A and type D S. aureus 
p-lactamases. Preliminary experiments in which residue 128 
has been replaced by other amino acids also have been shown 
to affect the kinetics of hydrolysis (unpublished observations). 

In conclusion, naturally occurring type A, C, and D variants 
of S. aureus p-lactamase exhibit kinetic differences due to sin- 



gle amino acid differences at positions close to the active site 
which have not previously been shown to be involved in enzy- 
matic activity. It is likely that the substitution at the amino acid 
128 affects enzyme function by altering the structure of cata- 
lytically important S-D-N loop. The differences between the 
type A and type C enzyme could be due to steric hindrance to 
substrate binding and/or some structural stabilization effects. 
This has to be verified by kinetic studies with mutant enzymes 
in which amino acids 128 and 216 are substituted with different 
amino acids as well as molecular modelling studies with differ- 
ent p-lactam substrates. 
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ABSTRACT An efficient 0-fucosidase was evolved by 
DNA shuffling from the Escherichia coli lacZ 0-galactosidase. 
Seven rounds of DNA shuffling and colony screening on 
chromogenic fucose substrates were performed, using 10,000 
colonies per round. Compared with native /3-galactosidase, 
the evolved enzyme purified from cells from the final round 
showed a 1,000-fold increased substrate specificity for o* 
nitrophenyl fucopyranoside versus o-nitrophenyl galactopyr- 
anoside and a 300-fold increased substrate specificity for 
/7-nitrophenyl fucopyranoside versus /?-nitrophenyl galacto- 
pyranoside. The evolved cell line showed a 66-fold increase in 
p-nitrophenyl fucosidase specific activity. The evolved fuco- 
sidase has a 10- to 20-fold increased k cnt /K m for the fucose 
substrates compared with the native enzyme. The DNA se- 
quence of the evolved fucosidase gene showed 13 base changes, 
resulting in six amino acid changes from the native enzyme. 
This effort shows that the library size that is required to 
obtain significant enhancements in specificity and activity by 
reiterative DNA shuffling and screening, even for an enzyme 
of 109 kDa, is within range of existing high-throughput 
technology. Reiterative generation of libraries and stepwise 
accumulation of improvements based on addition of beneficial 
mutations appears to be a promising alternative to rational 
design. 



Proteins and enzymes with novel functions and properties can 
be obtained either by searching the largely unknown natural 
species or by improving upon currently known natural proteins 
or enzymes. The latter approach may be more suitable for 
creating properties for which natural evolutionary processes 
are unlikely to have been selected. 

One promising strategy to create such novel properties is by 
directed molecular evolution. Starting with known natural 
protein(s), multiple rounds of mutagenesis, functional screen- 
ing, and amplification can be carried out. When the mutation 
rate, library size, and selection pressures are properly bal- 
anced, the desired phenotype of a protein generally increases 
with each round (1-8). The advantage of such a process is that 
it can be used to rapidly evolve any protein, without any 
knowledge of its structure. 

A number of different mutagenesis strategies exist, such as 
oligonucleotide cassette mutagenesis, point mutagenesis by 
error-prone PCR or the use of mutator strains, as well as DNA 
shuffling (1-5, 8). A theoretical approach to choosing a 
preferred mutagenesis strategy would be to determine the 
target protein's fitness landscape (9), which is a plot of fitness 
(on they axis) versus sequence space (on the x axis). However, 
because the sequence space of an average protein of 500 amino 
acids is 20 m \ determination of even a fraction of the fitness 
landscape is a nearly impossible and impractical undertaking. 
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Because there are just a few fundamental ways to search 
sequence space, it may be informative to compare the perfor- 
mance of these methods for specific model systems. 

Natural genes are thought to have evolved by mutation and 
recombination within a population of diverse, but highly 
related, sequences. We suggest that a search algorithm similar 
to that which slowly created the fitness landscape of a natural 
protein in the first place is likely to also be the preferred 
method for further searching this natural sequence landscape 
(5, 10). This approach is supported by our demonstration of the 
advantage of recombining mutations (over introduction of 
point mutations alone) for increasing the activity of a natural 
/3-lactamase protein (2). However, recombination may not 
always be the best search algorithm. For searching the fitness 
landscapes of nonnatural sequences under unusual conditions, 
it is conceivable that a different approach may be more 
optimal. 

We obtain in vitro recombination of infrequent point mu- 
tations by a PCR-based technique called DNA shuffling (1-5). 
A pool of closely related sequences is fragmented randomly, 
and these fragments are reassembled into full-length genes via 
self-priming PCR and extension in a process we call reassembly 
PCR (4). This process yields crossovers between related se- 
quences due to template switching. Shuffling allows rapid 
combination of positive-acting mutations and simultaneously 
flushes out negative-acting mutations from the sequence pool 
(Fig. 1). When coupled with effective selection and applied 
reiteratively, such that the output of one cycle is the input for 
the next cycle, reiterative DNA shuffling has been demon- 
strated to be an efficient process for directed molecular 
evolution (1-3). 

In our previous shuffling studies we used selection and/or 
large libraries (1, 2). Our primary goal in this work was to 
determine whether detection by screening of libraries of 10,000 
clones, a number that is within range of any high throughput 
screening procedure, would be sufficient to obtain significant 
enhancement of a minor activity of /3-galactosidase, a highly 
specific and complex enzyme, and at 109 kDa, one of the 
largest single-chain proteins in Escherichia coli. If screening 
would detect significant improvement, we then would establish 
that improvements are obtainable by evolution with such small 
libraries. 

E. coli /3-galactosidase, encoded by lacZ (11), is widely used, 
and its biological function, catalytic mechanism, and molecular 
structures are well characterized (11-15). It is a tetramer of 
identical subunits of 1,023 amino acids (13, 16, 17). The crystal 
structure of j3-galactosidase is solved and shows that each 
subunit forms five structural domains (14). Each active site 
resides mainly in one subunit, but part of another subunit also 
is involved (14). The native enzyme hydro lyzes j3-galactosyl 
linkages, such as the 0(1, 4)-linkage in its natural disaccharide 



Abbreviations: ONPG, o-nitrophenyl /3-D-galactopyranoside; ONPF, 
o-nitrophenyl /3-D-fucopyranoside; PNPG, p-nitrophenyl /3-D- 
galactopyranoside; PNPF, p-nitrophenyl /3-D-fucopyranoside; X-Fuc, 
5-bromo-4-chloro-3-indolyl /3-D-fucopyranoside. 
*To whom reprint requests should be addressed, e-mail: 
maxygen@maxygen.com. 
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Fig. 1. Schematic illustration of the DNA shuffling process used 
in the present study. 

substrate, lactose. The native j3-galactosidase is known to be 
highly specific for /3-D-galactosyl substrates. A multistep model 
of the reaction was proposed (18, 19) based on kinetic studies 
of the native enzyme for o-nitrophenyl /3-D-galactopyranoside 
(ONPG), /7-nitrophenyl /3-D-galactopyranoside (PNPG), and 
other substrates and substrate analogs. The native /3-galacto- 
sidase acts only weakly on /3-D-fucosyl moieties (18-20) and 
does not act on most substrate analogs. 

MATERIALS AND METHODS 

E. coli /3-galactosidase (EC 3.2.1.23) and the galactosyl and 
fucosyl substrates 5-bromo-4-chloro-3-indolyl j3-galactopyr- 
anoside (X-Gal), PNPG, ONPG, 5-bromo-4-chloro-3-indolyl 
j3-D-fucopyranoside (X-Fuc), p-nilro phenyl /3-D-fucopyrano- 
side (PNPF), and o-nitrophenyl /3-D-fucopyranoside (ONPF) 
were purchased from Sigma. Plasmid pCHHO containing a 
lacZ gene was from Pharmacia. E. coli strain TBI was a gift 
from Charles Roessner of Texas A&M University. 

Construction of Plasmid pl81acZ. A 3.8-kb Hindlll and 
BamHl restriction nuclease fragment from pCHHO contain- 
ing a lacZ gene (codon 8 fused to a short N-terminal peptide) 
and the gpt promoter region (21) was subcloned into the 
Hindlll and BatnHl sites of vector pl8-sfi-kan-sfi vector, a 
2.3-kb pUC18 derivative in which the ampicillin gene is 
replaced by a kanamycin phosphotransferase gene (2). The 
resulting plasmid, named plSlacZ, was used for DNA shuf- 
fling. DNA fragments of 50-200 bp were used and reassem- 
bled as described previously (1, 2). The PCR primers for 
amplification of the reassembled genes were AGCGC- 
CCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCC 
(forward) and CTATGCGGCATCAGAGCAGATTGTACT- 



G AG AGTG CACC AT (reverse), located on either side of the 
BamHl and Hindlll fragment. The reassembled gene was 
digested with restriction enzymes Hindlll and BamHl and 
ligated back into the P18-sfi-kan-sfi vector. The ligation mix- 
ture was electroporated into E. coli TBI competent cells and 
plated out on Luria-Bertani plates (150 mm) with 40 ftg/ml 
kanamycin and 2 mg/plate of the X-Fuc substrate (22). The 
plates were incubated 12 to 24 hr at 37°C. The resulting 
kanamycin-resistant transformants were visually screened for 
the intensity of the blue color. The 20-40 colonies with the 
most intense blue color were picked from about 10,000 trans- 
formants of each round and used for the next round of DNA 
shuffling. Seven rounds of DNA shuffling and screening were 
carried out. The best clone from the final screening round, 
called evolved /3-fucosidase, was characterized in detail. 

Enzyme Purification. For purification of the native /3-ga- 
lactosidase and the evolved /3-fucosidase, a histidine tag (Hiss) 
was fused to the N terminus of both enzymes by PCR with two 
primers [5'-(P)CATCACCATCACCACCATATCGTCAC- 
CTGGGACATGT and 5'-(P)GTATTTTTCGCTCATGT- 
GAA] in a standard PCR. The histidine-tagged native and 
evolved enzymes were purified from overnight TBI cell cul- 
tures harboring the corresponding plasmid (23). The crude cell 
extract, in 50 mM phosphate (pH 7.0) with 100 mM NaCl and 
0.2 mM of phenylmethylsulfonyl fluoride protease inhibitor 
was passed through a 20-ml Ni-nitrilotriacetic acid agarose 
(Qiagen) column. The bound protein was stepwise-eluted with 
the same buffer containing 5 mM, 10 mM, 25 mM, and 100 mM 
imidazole. The active fractions from the metal affinity column 
were desalted and loaded on a DEAE column in 20 mM Tris 
(pH 7.5), followed by elution with a 0 to 1 M NaCl gradient. 
The active fractions were concentrated and loaded on a 
Superose 12 gel filtration column in an FPLC protein purifi- 
cation unit (Pharmacia). SDS/PAGE analysis (data not 
shown) showed that the native galactosidase and the evolved 
fucosidase were greater than 90% pure. 

Enzyme Kinetics. /3-Galactosidase activity was assayed using 
the synthetic chromogenic substrates ONPG and PNPG. j3-Fu- 
cosidase activity was assayed using chromogenic fucosyl sub- 
strates ONPF and PNPF. Enzyme assays were performed at 
25°C and pH 7.0 in 30 mM iV-tris(hydroxymethyl)methylami- 
noethanesulfonic acid with 1 mM MgCl 2 and 150 mM NaCl. 
The absorbance change at 420 nm was recorded with time, and 
product formation was quantitated using the absorption ex- 
tinction coefficient (2.65 mM^-cm -1 for o-nitrophenol and 
6.7 mM'^cm" 1 for p-nitrophenol). For kinetic parameter 
measurements, the initial velocity V 0 (when less than 10% of 
the substrate was converted into product) was determined with 
varied substrate concentrations. The values of K max and K m 
were calculated using the simple weighting method of Cornish- 
Bowden (24). The V max values were converted to values, the 
turnover number per active site, by normalizing for the enzyme 
concentrations by the molecular mass of the monomer. ThtK m 
and k CAl values of the wild-type /3-galactosidase for ONPF 
could not be determined directly because of the low activity on 
this substrate. The k CixX /K m value had to be estimated from the 
enzyme dilution factor required for the native enzyme to 
generate the same amount of onitrophenol product from 
ONPG after the same period of time (usually, several hours) 
and from the k cui /K m value of the wild-type enzyme on ONPG. 

Sequencing of the Evolved lacZ Gene. The 3.8-kb DNA 
fragment encoding the evolved 0-galactosidase and its flank- 
ing regions was sequenced in both forward and reverse direc- 
tions with 20 primers using an Applied Biosystems 391 DNA 
sequencer. 

RESULTS AND DISCUSSION 

Strategy for Evolving /3-Galactosidase. The primary goal of 
the experiment was to determine if a substantial enhancement 
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in the specificity and/or activity of a large model enzyme could 
be obtained by reiterative screening of libraries of a size 
(10,000 clones) that is routinely accessible by high throughput 
detection assays. No structural information was used in the 
design of the experiment, but the structure of /3-galactosidase 
is useful for interpretation of results. 

Screening for Improved Fucosidase Activity. The 3.8-kb 
DNA fragment of pl8LacZ containing the lacZ gene was 
shuffled as described previously (1-3), and the reassembled 
genes were digested with restriction enzymes (///ndlll and 
BamHl) and ligated back into the vector pl8-sfi-kan-sfi. The 
initial diversity was introduced into the native lacZ gene by 
random point mutagenesis, which occurs by shuffling of small 
fragments (1, 2). We previously showed that shuffling with 10- 
to 50-bp fragments resulted in a 0.7% rate of point mutation. 
Here we used fragments of 50 to 200 bp, which results in a 
much lower rate of point mutation, resulting in inactivation of 
approximately 20% of the clones. X-Fuc was chosen as the 
indicator substrate for the plate assay because of the nondif- 
fusable nature of the colored product and the high sensitivity 
(22). After each round of DNA shuffling, 10,000 kanamycin- 
resistant transformants, growing on plates supplemented with 
X-Fuc, were visually screened for enhanced blue color forma- 
tion. About 2-5% of the transformant colonies in each round 
showed colonies that were more highly blue-colored than the 
bulk of the population. The 20-40 bluest colonies (0.2-0.4%) 
were picked at each round, individually verified to be more 
active than the pool from the previous round by plate assays, 
and then used as a pool for the source of DNA to initiate the 
next round of DNA shuffling. This number of colonies was 
chosen as a compromise between obtaining too little diversity 
(<10 colonies) and obtaining suboptimal selection pressure 
(»100 colonies), which could limit the rate of improvement. 
In the seventh round of DNA shuffling some colonies devel- 
oped a deep blue color after overnight growth (Fig. 2). One 
mutant from this seventh and final round of shuffling showed 
a 66-fold increase in fucosidase activity on 1 mM PNPF (Fig. 
3). 
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Fid. 2. E. coli TBI cells expressing the native /3-galactosidase 
(white colonies, Upper Left) and the evolved fucosidase of the seventh 
round (blue colonies, Upper Right) after overnight growth on an 
Luria-Bertani plus kanamycin plate supplemented with 0.1 mM 
X-Fuc. (Lower) The results of plating a deliberate mixture of the two 
types of colonies. 
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Fig. 3. Whole cell fucosidase activity on PNPF of the pool of 
colonies selected after each round of DNA shuffling. Rounds 1-7 are 
pools of colonies. Also shown are the activity of cells expressing the 
native /3-galactosidase and cells expressing the evolved /3-fucosidase, 
both measured as whole cell activity of single clones. The evolved 
fucosidase is the single-best colony selected after quantitative com- 
parison of the 24 best colonies from the pool of colonies obtained after 
shuffling round 7. For assay conditions, see Materials and Methods. 

Kinetics. After the final round of selection, (His)6 tags 
were added to the foreign N terminus of the native /3-ga- 
lactosidase and the evolved /3-fucosidase enzymes. Both 
enzymes were purified, and the kinetic constants of each 
enzyme on the synthetic chromogenic substrates ONPG, 
ONPF, PNPG, and PNPF were determined (Table 1). For 
PNPF, the K m value of the evolved fucosidase is decreased 
by 20-fold from the K m of wild-type /3-galactosidase on the 
same substrate. The k Cd i value is decreased about 2-fold. The 
^cat/Km values thus are increased about 10-fold in the evolved 
/3-fucosidase. The activity of the wild-type enzyme on ONPF 
was very low and accurate K m and k cat values could not be 
obtained. By comparing the relative reaction rates of the 

Table 1. Kinetic constants for the native and evolved enzymes 







Native 


Evolved 


Substrate 


Kinetic constant 


galactosidase 


fucosidase 


PNPG 




268 


30.9 




K m , mM 


0.04 


0.18 




kcn/K mt mM-^s" 1 


6,700 


172 


PNPF 


*cat, s~ l 


209 


96.6 




K m , mM 


31 


1.5 




*cat/#m, mM _1 'S _1 


6.7 


64.4 


Specificity 


(/r ca ,//C m )|> NF( j 


1,000 


2.7 




(^cai/ ^m)pNPF 






ONPG 


*cal, S" 1 


765 


14.5 




K m , mM 


0.11 


0.11 




kcai/K mt mM" 1 ^" 1 


6,950 


132 


ONPF 






24.1 




Km, mM 




0.55 




k ca i/K mt mM-'-s -1 


(2)* 


43.9 


Specificity 


(k CAl / /C m ")oNPG 


3,200 


3.0 




(k cat / /C m )oNPF 







The native galactosidase and the evolved fucosidase were purified, 
and the enzymes were assayed on four different substrates. 
*The k cal /K m value for the native galactosidase on ONPF was esti- 



mated to be about 2 mM -1 
relative to that of ONPG. 



•s -1 by measuring the hydrolysis rate 
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Fig. 4. Nucleotide substitutions in the evolved fucosidase gene. The predicted amino acid changes are shown above the gene by the single-letter 
denotation, numbered according to the wild-type /3-galactosidase sequence (17). Amino acid changes in the N-terminally fused peptide region 
(hatched area) are indicated by small vertical arrows. Mutations that do not result in amino acid changes are shown below the gene, numbered 
starting at the HinMl site, as in the parental plasmid pCHHO. The gpt promoter is indicated by a thick arrow. The positions of the known active 
site residues of the wild-type /3-galactosidase are indicated by black bars. 



wild-type enzyme on ONPF and ONPG (at the same enzyme 
and substrate concentrations), the k CAX /K m for ONPF was 
estimated, assuming that the k CA JK m value is a second order 
rate constant. The k ciil /K m values on ONPF were increased 
at least 20-fold in the evolved /3-fucosidase. These increases 
in fucosidase activity were accompanied by decreases in 
galactosidase activity. For the substrates PNPG and ONPG, 
the /ccat/^m is decreased 40-fold and 50-fold, respectively. 
These kinetic parameter changes suggest that the substrate 
binding pocket in the evolved /3-fucosidase is different from 
that of the wild-type /3-galactosidase. 

The native enzyme is highly specific for hydrolyzing galac- 
tosyl rather than fucosyl substrates. The k cal /K m values we 
determined for PNPG and PNPF differ by about 1,000-fold, 
and for ONPG and ONPF the values differ by more than 
3,000-fold (Table 1). The values we determined for the native 
/3-galactosidase on ONPG, PNPG, and PNPF are in between 
the values reported previously (18, 20). The substrate speci- 
ficity changed dramatically from the native /3-galactosidase to 
the evolved /3-fucosidase. For the evolved j8-fucosidase the 
k c JK m values for substrates PNPG and PNPF differ 2.7-fold 
and for substrates ONPG and ONPF the k C4Xl /K m values differ 



3-fold. Therefore, the relative substrate specificity for fucosyl 
substrates, from the native to the evolved enzyme, is increased 
1,000-fold for the o-nitrophenyl substrates and 300-fold for the 
p-nitrophenyl substrates. The substrate specificity change was 
further supported by inhibition of the enzymatic activity by 
isopropyl /3-D-thiogalactopyranoside, a /3-galactosidase sub- 
strate analog and a competitive inhibitor of galactosyl sub- 
strates. The K\ values increased by one order of magnitude 
from the wild-type enzyme to the evolved /3-fucosidase, from 
0.1 mM to 0.9 mM. The changes in K m values for the galactosyl 
substrates showed the same trend, because they either in- 
creased severalfold or stayed the same. These results imply that 
the substrate binding affinity of the evolved /3-fucosidase is 
substantially increased for fucosyl substrates and decreased for 
galactosyl substrates, and hence the substrate binding pocket 
is likely to be significantly modified. 

DNA Sequence. The DNA sequence of the evolved fucosi- 
dase gene showed 13 nucleotide substitutions of which 11 were 
in the coding region. Six of the mutations are predicted to 
cause amino acid changes in the translated /3-galactosidase 
sequence. Two additional mutations are predicted to cause 
amino acid changes in the N-terminal fusion peptide (Fig. 4). 
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Fig. 5. Ribbon representation of the £. coli /3-galactosidase subunit structure (14). The CNO atoms of the six amino acid mutations that 
conferred the fucosidase activity are shown with stick representation. Two mutations in the active site (Asp604 and Gln572) are shown in red. Two 
mutations in close proximity of the active site (Pro511 and Asp908) are shown in magenta. Two mutations far away from the active site and on 
the protein surface (Val9 and Glnl35) are shown in green. The rest of the substrate binding and active site residues are shown in yellow. 
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All 13 nucleotide changes were base transitions between 
purines and/or between pyrimidines, which usually are more 
frequent than transversions. 

One major advantage of in vitro evolution of enzymes over 
the structural modeling approach is that only minimal in- 
formation is required for improving the desired phenotype. 
Al each round of our experiment, only colonies with in- 
creased fucosidase activity were pooled and used for the next 
round of DNA shuffling and screening. Although both 
positive-acting mutations and neutral mutations may accu- 
mulate in the evolved fucosidase lacZ gene in each round, we 
expect that neutral mutations generally do not survive 
multiple rounds of shuffling and screening due to a back- 
crossing effect exerted by the consensus sequence (1, 2), 
combined with the lack of a selective advantage of the 
neutral mutations. Therefore, only mutations that somewhat 
contribute to the improved fucosidase activity are likely to 
accumulate in the evolved fucosidase. While we have not 
determined the effect of the separate mutations by site- 
specific mutational studies, we can predict what roles some 
of the mutations may play based on the three-dimensional 
structure of the parental j3-galactosidase (ref. 14; Fig. 5) and 
the sophisticated kinetic models based on previous muta- 
tions and kinetic analysis of purified proteins (18, 20, 25, 26). 
Among the six amino acid changes in the /3-galactosidase 
sequence, none appear directly involved in the inter-subunit 
contact. Three mutations (Pro511Ser, Gln573- 
Arg, and Asn604Ser) are located in domain 3 (residues 
334-627) of the wild-type E. coli j3-galactosidase (14). 
Domain 3 in the native protein contains most of the amino 
acids that form the substrate binding pocket (ref. 14; Fig. 5). 
Asn604 is one of the amino acids forming this substrate 
binding pocket in the protein (14), and this residue is 
conserved in several other known j3-galactosidase sequences 
(25, 27-29), except the evolved galactosidase gene (ebgA) of 
E. coli (30). In our evolved fucosidase, Asn604 is replaced by 
Ser. This mutation presumably affects the enzyme's sub- 
strate specificity. All the other mutations found in the 
evolved /3-fucosidase enzyme do not directly affect the active 
site and substrate binding pocket residues, and therefore 
they may have no effect or may only subtly change the 
conformation of the active site and substrate specificity. 
Gln573, substituted by Arg in the evolved fucosidase, is in 
close proximity to the substrate binding pocket (Fig. 5). The 
mutation Pro511Ser is also close to the active site and 
substrate binding pocket (Fig. 5). These two mutations are 
likely to affect the enzyme's active site. Asp908Asn is also 
close to the active site and may also affect the activity. 
Additional important catalytic residues of the active site, 
such as GIu461, Met502, Tyr503, and Glu537 (23, 26, 31), 
however, are unchanged in the evolved /3-fucosidase, imply- 
ing that the catalytic mechanism of the evolved enzyme 
remained the same. Therefore the evolved /3-fucosidase 
seems to have only adjusted to fit the fucosyl substrate or its 
transition state better than the wild-type /3-galactosidase 
does. In addition, one of the nucleotide mutations outside 
the structural gene (c23t) is very close to the gpt promoter 
region and could affect transcription (Fig. 4). This mutation, 
along with the two amino acid mutations in the N-terminal 
fusion peptide (Fig. 4), may influence the expression level of 
the protein. Indeed, we found that the evolved /3-fucosidase 
enzyme was expressed at least 2- to 3-fold higher than the 
wild-type enzyme (data not shown). The mutations Val9Ile 
and Glnl35Arg are far away from the active site and near the 
surface of the protein (Fig. 5), and may not have any 
significant effect on the enzymatic activity. The analysis of 
mutations obtained by molecular evolution of proteins pro- 
vides a new tool for studying structure-function relation- 
ships. However, the real utility of DNA shuffling is the 
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ability to rapidly improve enzyme functions without the need 
to delineate the myriads of complex molecular mechanisms. 

There are several possible applications for the evolved 
/3-fucosidase. One is as a novel reporter for /3-D-fucosyi 
substrates, in addition to the widely used lacZ gene reporter. 
The advantage of using the novel enzyme is the low endoge- 
nous background of /3-fucosidase activity because, unlike 
a-fucosidases, /3-fucosidases are uncommon in nature. This 
well expressed fucosidase also could be used for the production 
of fucosyl adducts or for disaccharide synthesis by transglyco- 
sylation or reversal of the hydrolysis reactions, because anal- 
ogous applications already have been demonstrated for the 
wild-type /3-galactosidase (32, 33). Some of these applications 
may require further evolution of the fucosidase for the specific 
reaction. The present data suggests that it is reasonable to 
attempt to obtain such improvements by DNA shuffling and 
screening of libraries of modest size. 

We thank A. Crameri for technical assistance and Drs. F. H. Arnold, 
R. E. Huber, and P. Schatz for useful discussions. 
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Complementary DNA Coding Click Beetle Ludferases 
Can Elicit Bioluminescence of Different Colors 

Keith V. Wood * Y. Amy Lam, Howard H. Seugbr, 
William D. McElroy 

Eleven complementary DNA (cDNA) clones were generated from messenger RNA 
isolated from abdominal light organs of the biohunincsccnt click beetle, Pyrcphorus 
plaglophthalamus. When expressed in Escherichia coli y these clones can elicit biolumines- 
cence that is readily visible. The clones code for ludferases of four types, dist i nguishe d 
by the colors of bioluminescence they catalyze: green (546 nanometers), yellow-green 
(560 nanometers), yellow (578 nanometers), and orange (593 nanometers). The 
amino add sequences of the different hidferascs are 95 to 99 percent identical with 
each other, but are only 48 percent identical with the sequence of firefly luciferase 
(Photinus pyralis). Because of the different colors, these dones may be useful in 
experiments in which multiple reporter genes are needed. 



NEARLY ALL OUR KNOWLEDGE OF 
beede lucifcrases is derived from 
studies of a single species, the 
North American firefly Pkotinus pyralis. 
Comparative studies with other beede lucif- 
crases have been hampered because of limit- 
ed availability of the other species. Evolu- 
tionary, beetle lucifcrases arc unrelated to 
any of the other groups of lucifcrases that 
have been studied biochemically (J). Little is 
known about the lucifcrases from other bee- 
tles except that they all catalyze the produc- 
tion of various colors of light through the 
oxidative decarboxylation of beede luciferin 
(2). Since the substrates of the luminescent 
reaction arc the same in all these beetles, the 
different colors must be due to differences in 
the structure of the enzymes (J). 

Recently we doned a cDNA that codes 
for the luciferase of P. pyralis^ and have 
shown that it can be used to express biolu- 
minescence in Escherichia coU. We report here 
the doning of cDNAs that code for several 
new lucifcrases from a biolumincscent dick 
beetle, Pyrcphorus plagiophthalamus. This bee- 
de is unusual because it can emit biolumi- 
nescence of a wide range of colors from a 
single spedes. The expression products in E. 
coli of the cDNAs derived from this beede 
arc able to produce green, yellow-green, 
yellow, and orange light. As determined 
from the nudeotide sequences of the dones, 
the amino add sequences of these dick 
beede lucifcrases are highly conserved 
among one another, but diverge from the 
sequence of the firefly luciferase. Taxonomy 
indicates that the click beetle ludferases 
probably are the most evolutionarily distant 
of the beede lucifcrases from the firefly 
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luciferase (4). This distance is reflected by 
differences in their chemical properties. 

Pyrophoms plagiophthalamus is a large beede 
with two sets of light organs. One set, on 
the dorsal surface of the head, emits light 
that is greenish but the exact color varies 
between individual beetles of the species, 
ranging from green (548 nm) to ycllow- 
green (565 nm). The other set, at the anteri- 
or of the abdomen, generally emits light of a 
longer wavelength than the head organs but 
also varies between individuals ranging from 
green (547 nm) to orange (594 nm) (5). We 
converted mRNA isolated from the abdomi- 
nal light organ of 60 beetles to cDNA and 
inserted this into a specialized lambda don- 
ing vector, Lambda ZAP (6). The ability to 
convert this modified lambda vector into a 
bacterial expression plasmid (Bluescript) 
through an in vivo process allowed us to 
screen the cDNA library by two methods 
(7). In the phage form of the library,-.we 
screened with antibody to firefly luciferase 
that cross-reacts with the click beede lucifcr- 
ases (8) and isolated four full-length clones 
that expressed bioluminescence in E. coli. A 
portion of the cDNA library was converted 
into the plasmid form, and we screened this 
for bioluminescence in the bacterial colo- 
nies. Bioluminescence can be initiated in 
colonies of E. coli expressing ludferase by 
adding luciferin to the media (9). Seven 
more cDNA dones were isolated by this 
method. It was determined visually that of 
the eleven dones, one produced green light, 
one produced ycllow-grccn light, six pro- 
duced yellow! light, and three produced or- 
ange light. • 

Immunobldt analysis confirmed the pro- 
duction of full-length click beede ludferase 
in E. coli. Despite some of these dones being 
detected with antibody to firefly ludferase 
during the library screening of plaques, we 
could not detect the gene products in blots 
made direcdy with E. coli lysatcs. The 



expression of bioluminescence was im- 
proved by transferring the cDNA dones 
into a plasmid vector incorporating the tat 
promoter (10). A lysate from £. coli express- 
ing the green-emitting luciferase from this 
vector was partially purified. After gd dec- 
ttophoresis and blotting, a single antigenic 
band was revealed that comigrated with the 
native dick beede luciferase. Subsequently 
one cDNA done from each of the four 
color-emitting groups was sequenced. An 
open reading frame was revealed in each that 
could potentially code a protein, the se- 
quence of which correlated with the entire 
length of die sequence for firefly ludferase. 
Thus the complete protein coding regions of 
the dick beetle ludferases were apparently 
contained within their cDNA dones. 

Expression of bioluminescence from the 
tac vector yidded sufficient intensity, upon 
addition of ludferin to the media, to allow 
measurement of the spectral distribution 
from intact cells (Fig. 1). This confirmed the 
visual assignment of the 11 cDNA dones 
into four color groups. For each of the four 
colors, the bioluminescence spectrum is a 
single peak qualitatively similar to the spec- 
tra of the native click beede ludferases (J). 
The range of colors from the dones is 
representative of the full range measured 
from the abdominal light organs of living 
beetles. However, there are colors emitted 
by die beetles, within the extremes of this 
range, that do not correspond to any of the 
dones (5). Thus other ludferase genes may 
not have been isolated. Spectra of the lucif- 
crases were also measured from partially 
purified preparations obtained from lysatcs 
of the H. coli expressing the cDNA clones. 




Fig. 1. Bioluminescence from colonics of E. coli 
expressing die click beetle hidferascs. Four streaks 
of E coli, each consisting of hundreds of colonies, 
show the four colors of biduraincscence emitted 
by the different hidferascs. The colonics were 
grown on nitrocellulose, filters layered on top of 
nutrient agar. To initiate the biolumincscent reac- 
tion, the alters were removed from the agar and 
soaked with 1 mAf ludferin in 100 mAf sodium 
citrate, pH 5.0. The photograph was produced 
from a 2-5 contact exposure of the colonics onto 
Ektach roroc 64. 
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Fig. 2. Spectra of the click beetle luciferases, from 
intact E. coli immersed in hrcifcrin, show four 
overlapping peaks of nearly even spacing. The 
maximum intensity of each spectrum has been 
normalized. The spectra were measured on a 
Fasac-Ebcrt-tvpc grating spectrometer (2) and 
were corrected for the spectral sensitivity of the 
photomulriplicr tube and for time-dependent 
variation in the intensity of the luminescent reac- 
tions. Wavelength positions were calibrated 
against the spectral lines of a mercury vapor lamp. 
Virtually identical spectra were obtained from 
tysatcs of these E. coli between pH 6 to 8. The lysates were prepared from cells grown to middle or late 
log-phase growth at 37°C, and then for 2 hours at 30°C with isopropyithiogalactosidc (IPTG) added to 
1 mM. The cells were washed and rcsuspended in approximately 1/150 volume of the culture with 100 
mM potassium phosphate, pH 7.0, 2 mM EDTA, and 35% of saturation (NHOjSO* After lysis by 
sonicadon and removal of the debris by centrifugation, (NH^SC^ was added to 53% of saturation and 
the precipitate was dissolved in 171 5 volume of the lysate with 100 mM potassium phosphate, pH 7.0, 2 
mM EDTA, and 50% glycerol. The spectra were measured from 10 pj of this solution diluted 100-fold 
with 50 mM 2-(N-morpholino)cthancsulfonic acid, 50 mM 2-(N-morpholino)propancsulfonic acid 
(MOPS), 50 mM tricine, 5 mMMgSO* 1 mM EDTA, 0.1 mM luciferin, i.5 mM ATP, 1 mM NaF, 
0.2 mg of bovine scrum albumin per milliliter, and 10% glycerol. The spectra were measured atpH 6.0, 
7.0, and 8.0. 
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Fig. 3. Alignment of the amino acid sequences of the click beetle and the firefly luciferases is shown to 
emphasize sequence differences. The sequence information is derived from the open reading frames of 
the corresponding cDNA clones. The identity of each lucifcrase sequence is indicated at the right of 
each line by a two letter code: FF, firefly; GR, green-emitting click beetle; YG, yellow-grecn-cmitting 
click beetle; YE, yellow-emitting click beetle; and OR, orange-emitting click beetle. Only the sequence 
for the green-emitting click beetle lucifcrase is shown in entirety. Gaps in the alignment of this sequence 
arc indicated by hyphens. Other luciferase sequences have letter designations only at sites where they 
differ from the ^reen-emitting luciferase; where the sequences are the same there is a period. Numbers 
on the right indicate the position of the amino acid at the end of each line. Abbreviations for the amino 
acid residues are A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; I, He; K, Lys; L, Leu; M, Met; 
N, Asn; P, Pro; Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 



Between pH 6 to 7, the spectra of these 
preparations were indistinguishable from 
those of intact cells. At pH 8 there was a 
slight broadening of the spectra for the 
green- and yeliow-cmitting luciferases. The 
firefly luciferase shows a large spectra] shift 
between pH 6 to 8. At pH 8 its spectral 
maximum is at 560 nm, which shifts to 615 
nm (red) at pH 6 with a decrease in the 
quantum yield (11). 

The sequences of the different click beetle 
luciferases arc highly similar (Fig. 2). The 
open reading frame of each of the sequenced 
cDNA clones potentially codes a 543-resi- 
due polypeptide. Comparisons of the de- 
rived amino acid sequences show a 95 to 
99% identity between the different color- 
emitting luciferases. Thus the number of 
amino acids that arc responsible for the 
differences in the color is small. Because 
variation in color results directly from differ- 
ences in the primary structures of the lucifer- 
ases, specialized posttranslational modifica- 
tions or unusual microenviron mental effects 
are not necessary to account for the color 
variation in the living beetles. 

Comparison of the sequences of click 
beetle luciferases with that of firefly lucifer- 
ase shows a low similarity. Alignment of 
their deduced amino acid sequences reveals 
that the various click beetle and the firefly 
luciferases are 48% identical (Fig. 3). Six 
gaps in the alignment of one to two amino 
acids in length account for most of a scven- 
amino acid difference in the lengths of the 
open reading frames between the firefly and 
click beetle luciferases. No regions in the 
alignment show especially high sequence 
similarity, thus giving little indication that 
particular regions have been conserved be- 
cause of catalytic or structural constraints on 
the enzymes. An exception to this is in the 
last three amino acids which, for the firefly 
lucifcrase, have been shown to be necessary 
for translocation into peroxisomes (12). 
Given the close functional similarity of these 
enzymes, it is almost certain that the click 
beetle luciferases arc also located in peroxi- 
somes. 

Firefly lucifcrase has historically been 
used as a bioluminesccnt reporter of chemi- 
cal events associated with adenosine triphos- 
phate (ATP) metabolism (13). With the 
cloning of its cDNA, this luciferase has also 
recently found application as an effective 
reporter of genetic events (14, 15). Its prin- 
cipal advantages are that (t) the initial poly- 
peptide derived from the mRNA requires 
no posttranslational modifications for enzy- 
matic activity; (ii) the luminescent reaction 
can be measured with high sensitivity; (iii) 
the assay of the gene product is rapid and 
does not use substrates requiring special 
precautions (such as radioactive isotopes or 



chemically unstable compounds); and (iv) 
gene expression may be detected without 
disruption of living tissue. Compared with 
the conventionally used assay of chloram- 
phenicol acetyltransferasc (CAT) for gene 
activity, firefly luciferase is assayed in min- 
utes as opposed to hours, and is 100 to 1000 
times more sensitive (IS). 

The cDNAs coding for the click beetle 
luciferases also have these features, and, as 
they can be distinguished by color, may be 



useful in situations where multiple reporters 
are desirable. Expression in exogenous hosts 
should differ little between these luciferases 
because of their sequence similarity. Also, 
since the colors do not shift near physiologi- 
cal pH, the different luciferases can be dis- 
tinguished in vivo as well as in vitro. Thus 
the click beetle luciferases may provide a 
dual reporter system that can allow two 
different promoters to be monitored within 
a single host, or for different populations of 
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The class A p-lactamase PER-1, which displays 26% 
identity with the TEM-type extended-spectrum 
^-lactamases (ESBLs), is characterized by a substrate 
profile similar to that conferred by these latter enzymes. 
The role of residues Alal64, Hisl70, Alal71, Asnl79, 
Arg220, Thr237 and Lys242, found in PER-1, was assessed 
by site-directed mutagenesis. Replacement of A la 164 by 
Arg yielded an enzyme with no detectable P-lactamase 
activity. Two other mutants, N179D and A164R+N179D, 
were also inactive. Conversely, a mutant with the A171E 
substitution displayed a substrate profile very similar to 
that of the wild-type enzyme. Moreover, the replacement 
of Ala 171 by Glu in the A164R enzyme yielded a double 
mutant which was active, suggesting that Glul71 could 
compensate for the deleterious effect of Argl64 in the 
A164R+A171E enzyme. A specific increase in A cat for 
cefotaxime was observed with H170N, whereas R220L and 
T237A displayed a specific decrease in activity towards the 
same drug and a general increase in affinity towards 
cephalosporins. Finally, the K242E mutant displayed a 
kinetic behaviour very similar to that of PER-1. Based on 
three-dimensional models generated by homology model- 
ling and molecular dynamics, these results suggest novel 
structure-activity relationships in PER-1, when compared 
with those previously described for the TEM-type ESBLs. 
Keywords: P-lactamase/expanded-spectrum cephalosporins/ 
homology modelling/PER-1 /serine enzyme 



Introduction 

PER- 1 is a class A P-lactamase characterized by a high 
catalytic activity against expanded-spectrum cephalosporins 
[e.g. cefotaxime (CTX) and ceftazidime (CAZ)] and mono- 
bactams [e.g. aztreonam (AZT)]. The enzyme, which was first 
identified in Pseudomonas aeruginosa (Nordmann et aL, 1 993), 
displays a kinetic behaviour very similar to that of various 
extended-spectrum P-lactamases (ESBLs) belonging to the 
TEM and SHV families (Jacoby and Medeiros, 1991). In 
contrast, PER-1 shares a relatively low amino acid identity 
with these latter enzymes, e.g. 26% with TEM-3 (Nordmann 
and Naas, 1994). 

Recently, we have undertaken biochemical studies in order 



to elucidate the molecular basis of PER-1 activity against 
expanded-spectrum cephalosporins (Bouthors et aL, 1998). 
Molecular modelling and site-directed mutagenesis were used 
to investigate in this enzyme the role played by the amino 
acid residues corresponding to those found at positions 104, 
164, 238 and 240 in the TEM-type ESBLs. In brief, two 
residues, Asnl04 and Ala 164, were shown to be important for 
the activity of PER-1. Asnl04, which corresponds to the lysine 
residue found at the same position in various TEM-type 
ESBLs, would be connected to the key catalytic residue 
Glul66 via a hydrogen bond network, whereas Alal64, which 
corresponds to a highly conserved arginine in the Q-loop of 
class A P-lactamases described so far (Ambler et aL, 1991), 
could play an important structural role. By contrast, modifica- 
tion of the serine residue found at position 238 in PER-1, 
which is an amino acid found specifically in a large number 
of TEM-type ESBLs (Bush and Jacoby, 1997), resulted in no 
significant modification of the activity of PER-1 against 
expanded-spectrum cephalosporins. Similarly, Gly240 in PER- 
1 was shown to have no essential role in the substrate profile 
of the enzyme. Finally, the catalytic residue Glu 166, found in 
all class A P-lactamases, appeared to be essential to the 
P-lactamase activity of PER-1. However, an unexpected 
residual activity against CAZ and AZT was observed for a 
mutant in which Glu 166 was replaced by Ala, suggesting that 
other residues in PER-1 could contribute to the high activity 
of the enzyme against expanded-spectrum cephalosporins. 

In this work, we investigated other amino acid residues 
found either within or at the vicinity of the PER-1 active site: 
Alal64, Hisl70, Alal71 and Asnl79 which are located within 
the putative Q-loop of PER-1, Thr237 which is found at the 
end of the p3 strand and is likely to participate in the formation 
of the oxyanion hole (Herzberg and Moult, 1987; Strynadka 
et aL, 1992), Arg220 which is located in a position similar to 
that of Arg244 found on strand P4 in TEM-1 and which could 
contribute to the stabilization of the oxyanion pocket via 
hydrogen bonding interactions with strand P3 (Moews et aL, 
1990; Jacob-Dubuisson et aL, 1991) and Lys242 found in the 
loop connecting strands P3 and P4. All these residues were 
modified by site-directed mutagenesis and the kinetic properties 
of the resulting mutants were characterized. By using homology 
modelling and molecular dynamics simulations, we have 
attempted to interpret at the structural level the kinetic data 
obtained for some of the Q-loop mutants. 

Materials and methods 

Chemicals 

Antibiotic powders were provided by the following manufac- 
turers: penicillin G, Laboratoires de Therapeutique Moderne 
(Suresnes, France); ampicillin, cephalothin and kanamycin, 
Sigma Chemical (St Louis, MO, USA); cefotaxime, Labora- 
toires Roussel (Paris, France); nitrocefin and ceftazidime, 
Glaxo (Paris, France); and aztreonam, Bristol Myers Squibb 
(Paris-La Defense, France). 
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Table I. Nucleotide sequence of the oligonucleotides used in site-directed 
mutagenesis 

Amino acid Oligonucleotide sequence 0 
modification 



Ala 164 -* Arg 5 ' -CATCTGCGCTTCATTTCGGACCACAGCGGTCTC-3 ' 

His 170 -> Asn 5 ' -CACCTGATCATCGGCGTTCATCTGCGCTTCATT-3 ' 

Ala 1 7 1 -> Glu 5 ' -CTGCACCTGATCATCTTCGTGCATCTGCGCTTC-3 ' 
Asn 179 -> Asp 5 ' -TTTCATCGAGGTCCAGTCTTGATACTGCACCTG-3 ' 

Arg220 -> Leu 5' -TAACAAACCTTTTAACAGCTCTGGTCCTGTGGT-3 ' 

Thr237 -» Ala 5 ' -GGCTTTGATACCCGAAGCACCAGTTTTATGTG-3 ' 

Lys242 -> Glu 5 ' -CGCAGTTTTTCCGGCTTCGATACCCGAAGTACC-3 ' 

a Specific base changes are underlined. 



The restriction enzymes used in this study were obtained 
from Boehringer Mannheim (Meylan, France) and T4 DNA 
ligase from Promega (Madison, WI, USA). [ 33 P]dCTP was 
purchased from Isotopchim (Ganagobie, France). 

Escherichia coli strains, plasmids and growth conditions 

E.coli CJ236 (Kunkel et al, 1987) and MV1190 (McClary 
et al, 1989) were used as hosts for phages in site-directed 
mutagenesis experiments. E.coli JM109 (Promega) was used 
for DNA cloning experiments and for expression of bla PBR . { 
and the corresponding mutant genes. 

The recombinant plasmid pRAZl, encoding nas 
been described by Nordmann et al (1993). Bacteriophage 
M13mpl9 (Messing, 1983) was used as a vector in site- 
directed mutagenesis experiments. Plasmid pK19 (kana- 
mycin®) (Pridmore, 1987) was used in cloning experiments. 

E.coli MV1190 and JM109 were grown at 37°G in Lurian- 
Bertani (LB) (Difco, Detroit, MI, USA) and brain-heart infusion 
(BH1) (Difco), respectively. Solid media were obtained by the 
addition of 2% Bacto-Agar (Difco). Kanamycin (25 u.g/ml) 
and ampicillin (100 jig/ml) were added when necessary. 
Competent E.coli cells were prepared and transformed as 
described by Chung et al (1989). 

Nucleic acid techniques 

Plasmid DNA was purified using either the alkaline lysis for 
mini-preparations (Birnboim and Doly, 1979) or the Qiagen 
plasmid kit for maxi-preparations (Qiagen, Hilden, Germany). 
Isolation of single-stranded DNA and other standard DNA 
manipulations were carried out according to Sambrook et al 
(1989). Double- and single-stranded DNA sequencing were 
carried out by the dideoxynucleotide chain termination method 
(Sanger et al, 1977) using the T7 Sequencing kit (Pharmacia 
Biotech, Saint Quentin en Yvelines, France). 

Site-directed mutagenesis 

Site-directed mutagenesis experiments were performed as 
described previously (Bouthors et al, 1998). In brief, the 
MtfpER-i g ene was excised from pRAZl (1.3 kb) and introduced 
into M13mpl9 RF. Site-directed mutagenesis was performed 
using the uracil template procedure of Kunkel et al (1987). 
The sequences of the synthetic phosphorylated oligonucleotides 
(Eurogentec, Liege, Belgium) used to introduce the different 
mutations in the bla ?ER „\ gene are listed in Table I. After 
mutagenesis, each mutant gene was cloned into plasmid pK19 
and the recombinant plasmids thus obtained were introduced 
by transformation into E.coli JM109. The mutant genes were 
all sequenced in their entirety and on both strands. 



Expression and purification of the wild-type and mutant 
^-lactamases 

The wild-type and mutant enzymes were purified from 1 1 
cultures by a two-step procedure based on an anion-exchange 
column followed by a gel filtration, as described previously 
(Bouthors et al y 1998). The mutant P-lactamases displaying 
significant activity were detected using the chromogenic 
cephalosporin nitrocefin (O'Callaghan et al, 1972), while the 
almost inactive enzymes A164R, N179D and A164R+N179D 
were identified by electrophoresis on 12% SDS-polyacryl- 
amide gels (Laemmli, 1970), with the wild-type PER-1 p- 
lactamase as a molecular mass reference. In order to avoid 
concerns about enzyme stability, kinetic studies were performed 
shortly after purification. The purity of the different enzymes 
was assessed by Coomassie Blue staining of SDS-polyacryl- 
amide gels after electrophoresis. Protein concentration was 
determined by measuring the absorbance at 280 nm (Lorber 
and Giege, 1992) with an e value of 34 850 M^.cnr 1 (Bouthors 
et al, 1998). For mutants exhibiting more than one protein 
band on SDS-PAGE analysis, the intensity of the p-lactamase 
band was measured with a computerized densitometer 
(Densylab, Bioprobe) and the enzyme concentration was deter- 
mined with reference to a standard BSA scale analyzed in the 
same conditions. 

Isoelectric focusing 

Isoelectric focusing was performed with a LKB Multiphor 
apparatus with pH 3.5-9.5 PAG plates (Pharmacia Biotech). 
Gels were focused at 30 W for 90 min at 10°C. P-Lactamase 
activity was revealed by staining with the nitrocefin assay. 

Determination of the kinetic parameters of the wild-type and 
mutant enzymes 

Kinetic assays were performed spectrophotometrically in 0. 1 M 
sodium phosphate buffer (pH 7.0) at 30°C on a Uvikon 
940 spectrophotometer. The wavelengths and the extinction 
coefficients used were as follows: penicillin G, 232 nm, 
Ae = -1100 M^.crrr 1 ; cephalothin, 262 nm, Ae = 
-7960 M- l .cnr l; cefotaxime, 260 nm, Ae = -6710 M^.crrr 1 ; 
ceftazidime, 260 nm, Ae = -8660 M~ 1 .cm _l ; and aztreonam, 
318 nm, Ae = -650 M^.cnr 1 . For each antibiotic, initial rates 
were measured at six different substrate concentrations. Kinetic 
parameters were determined by fitting the Michaelis-Menten 
equation to the experimental data using the regression analysis 
program LEONORA written by Cornish-Bo wden (1995). The 
values for k cat and K m were estimated using a non-linear least- 
sqares regression method with dynamic weights (Cornish- 
Bowden, 1995). 

Molecular modelling 

The refined theoretical three-dimensional structures of PER-1 
and the mutant enzymes were constructed by homology model- 
ling using the computer program Swiss-Model (Peitsch, 1996), 
as described previously (Bouthors et al, 1998). The models 
were then subjected to 5000 steps of energy minimization 
using the Powell minimizer of X-PLOR (Brunger, 1988). The 
Q-loop region in the resulting minimized structures was 
subjected to molecular dynamic simulations in vacuum. The 
molecular dynamics were initially performed on the 150-190 
region of PER-1 containing the Q-loop (residues 161-179) 
and the two a-helix regions enclosing the loop (residues 150— 
160 and 180- 190, respectively). The results obtained from 
this large segment indicated that the two a-helix regions 
enclosing the loop were very stable (r.m.s.d. = 0.2 A). 
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Therefore, the molecular dynamic simulations were sub- 
sequently confined to the region encompassing residues 160— 
180, using the following simulation procedure: the target 
temperature started at 0 K to reach the final temperature, 
300 K, within 18 ps. After 30 ps of stabilization at 300 K, the 
molecular dynamic phase lasted 100 ps at 300 K, with a time 
step of 0.001 ps and a dielectric constant (e) of 4.0. The 
conformations trapped at 300 K were visualized by using the 
VMD (Visual Molecular Dynamics) program (Humphrey et aL, 
1996). The mean of the conformations, which was subjected 
to 500 steps of energy minimization, was used in structure 
comparison. 

Results 

Production and purification of the mutant enzymes 
Site-directed mutagenesis was used to replace the amino acid 
residues located at positions 164, 170, 171, 179 and 237 in 
PER-1 by those found at the same positions in TEM-1 having no 
significant activity against expanded-spectrum cephalosporins. 
Thus, the single mutants A164R, H170N, A171E, N179D and 
T237A, and also two double mutants, A164R+A171E and 
A164R+N179D, were constructed. In addition, Lys242 in 
PER- 1, which could be the counterpart of the lysine residue 
found at position 240 in various TEM-ESBLs (Bush and 
Jacoby, 1997), was replaced by a glutamic acid residue, as 
found in TEM-1 at position 240 (Sutcliffe, 1978). Finally, 
Arg220 in PER-1, which is equivalent to Arg244 located on 
the p4 strand in TEM-1, was replaced by a leucine. SDS- 
PAGE analysis of crude extracts showed that all the mutant 
(3-lactamases except three, were expressed in normal amounts 
(data not shown). Indeed, when compared with the wild-type 
enzyme, the A164R, N179D and A164R+N179D mutants 
were expressed at very low levels and the various purification 
attempts carried out in order to determine the kinetic features 
of the three mutants remained unsuccessful. Production and 
purification of the other enzymes, which were all active, were 
performed as described previously (Bouthors et aL, 1998). 

Isoelectric focusing, carried out on the purified enzymes, 
indicated that three mutants, A164R+A171E, H170N and 
T237A, displayed pi values indistinguishable from that of the 
wild-type protein (pi = 5.4). Conversely, the pi values found 
for the R220L, A171E and K242E mutants were shifted 
towards more acidic values (pi = 5.2, 5.0 and 4.9, respectively) 
(data not shown). Finally, the isoelectric points of the three 
mutants A164R, N179D and A164R+N179D could not be 
determined since the corresponding crude extracts contained 
no significant P-lactamase activity. 

Kinetic analysis 

The steady-state kinetic parameters & cat and K m for penicillin 
G, cephalothin, cefotaxime (CTX), ceftazidime (CAZ) and 
aztreoham (AZT) were determined from the purified active 
(3-lactamases. The values obtained are shown in Table II. 

Wild- type ^-lactamase. As expected, the values of the rate 
constants obtained for the wild-type PER-1 p-lactamase were 
similar to those reported previously (Bouthors et aL, 1998). 
The enzyme was characterized by a high apparent affinity for 
penicillin G, cephalothin and AZT (K m values ranging from 
23 to 147 jiM), but a poor apparent affinity for the expanded- 
spectrum cephalosporins CTX and CAZ (441 and 4150 |iM, 
respectively). Conversely, the k cat values for the last two drugs 
(41 and 109 s~\ respectively) were markedly higher than those 
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found for the other (3-lactam antibiotics (fc cat values ranging 
between 8 and 11 s" 1 ) (Table II). 

Mutants of residues located in the Q-loop. Four positions were 
investigated at the level of the Q-loop region of the protein: 
164, 170, 171 and 179. Four single mutants (A164R, H170N, 
A 1 7 1 E and N 1 79D) and two double mutants (A 1 64R + A 1 7 1 E 
and A164R + N179D) were analyzed, as described below. 

Mutants A164R, AI71E and A164R+A171E. As observed 
previously (Bouthors et aL, 1998), no significant enzymatic 
activity was detected with the A164R mutant. By contrast, the 
substitution of the alanine residue found at position 171 in 
PER-1 by a glutamate resulted in no significant modifications 
of the fc cal and K m values, when compared with the wild-type 
enzyme (Table II). Similarly, the double mutant A164R+ 
A171E yielded an active enzyme which showed k czx and K m 
values similar to those of PER-1, but the k cat /K m ratios for 
CTX, CAZ and AZT were increased by at least an order of 
magnitude (Table II). 

Mutants N179D and AI64R+NI79D. Position 179 is well 
conserved in class A ^-lactamases, where an aspartate residue 
is generally found (Table III). The mutation Asnl79 -> Asp 
in PER-1, either in the N179D mutant or in the double mutant 
A164R + N179D, resulted in a complete loss of activity and 
the corresponding enzymes could not be purified. 

Mutant HI70N. A histidine residue is found at position 170 
in PER-1, instead of the highly conserved Asnl70 found in 
most of the class A P-lactamases described so far (Ambler 
et aL, 1991) (Table III). For penicillin G, cephalothin, CAZ 
and AZT, the H170N mutant displayed k c&t and K m values 
similar to those of PER-1 (Table II). By contrast, a marked 
increase in k cat was observed for CTX (5.5-fold) with a 
concomittant decrease in the apparent affinity (~3-fold), thus 
resulting in a 2-fold increase in k c JK m . 

Mutants of residues located in the a/ft domain. Three 
positions were studied in the a/(3 domain: position 220, position 
237 on strand p3 and position 242 on the loop connecting P3 
and (34 (Bouthors et aL, 1998). 

Mutant R220L. An arginine is found at position 220 in 
PER-1 (Table III). This residue might be the equivalent of 
Arg244 in the TEM enzymes, as previously suggested (Matagne 
and Frere, 1995). Replacement of Arg220 by a leucine yielded 
a mutant (R220L) displaying no significant modifications of 
the kinetic parameters for penicillin G and cephalothin. For 
the other drugs (CTX, CAZ and AZT), a general increase in 
apparent affinity was observed. In addition, a significant 
decrease in £ cat for CTX was noticed (3.2-fold) (Table II). 

Mutant T237A. As in the ESBLs TEM-5 and TEM-24 
(Sougakoff et aL, 1989; Chanal et aL, 1992), position 237, 
which contibutes to the oxyanion pocket and corresponds to 
an alanine in TEM-1, is occupied by a threonine in PER-1 
(Table III). The replacement of Thr237 by Ala yielded an 
enzyme which exhibited a higher apparent affinity for most of 
the substrates tested, particularly for CTX and AZT (^ m values 
lowered by 40- and 10-fold, respectively). By contrast, specific 
and divergent variations of & cat were observed for CAZ (6-fold 
increase) and CTX (4-fold decrease), but, overall, the k c JK m 
ratio for all the substrates tested was markedly increased. 

Mutant K242E. Lysine 242, which would be located in 
PER-1 on a large loop connecting strands 03 and |54 (Bouthors 
et aL, 1998), could be the counterpart of Lys240 found in 
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Table II. Kinetic parameters 3 for hydrolysis of (J-lactam antibiotics by PER-1 and the corresponding mutants 

Enzyme Penicillin G Cephalothin Cefotaxime Ceftazidime Aztreonam 





x *m 


"cat 


k IK 




A cat 


k IK 


K m 


*cat 


k^\l K 




*cat 


^cat^ 




*cat 


^cat^ 


PER-1 


27 


8 


296 


23 


8 


348 


441 


41 


93 


4150 


109 


26 


147 


11 


75 




± 3 


± 0.2 


± 22 


± | 


± 0.1 


± 17 


± 42 


± 2 


± 8 


± 611 


± 15 


± 8 


± 15 


± 0.7 


± 15 


A164R 


ND b 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


A171E 


49 


9 


184 


47 


14 


298 


284 


39 


137 


4309 


134 


31 


42 


5 


119 




± 2 


± 0.2 


± 13 


± 2 


± 0.2 


± 20 


± 6 


± 0.4 


± 4 


± 95 


± 3 


± 1 


± 1 


± 0.1 


± 5 


N179D 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


A164R+A171E 


23 


9 


391 


27 


12 


444 


300 


61 


203 


2087 


123 


59 


45 


15 


333 




± 0.01 


± 0.01 


± 0.2 


± 1 


± 0.2 


± 25 


± 19 


± 2 


± 20 


± 207 


± 10 


± 11 


± 4 


± 0.5 


±39 


A164R+N179D 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


H170N 


15 


4 


266 


10 


4 


400 


1286 


225 


175 


4051 


150 


37 


78 


14 


179 




± 0.7 


± 0.04 


± 15 


± 0.4 


± 0.05 


± 9 


± 85 


± 11 


± 8 


± 616 


±19 


± 10 


± 9 


± 1 


±19 


R220L 


15 


10 


666 


10 


4 


400 


167 


13 


78 


1329 


83 


62 


39 


6 


154 




± 0.4 


± 0.06 


± 47 


± 0.3 


± 0.03 


± 14 


± 8 


± 0.3 


± 5 


± 33 


± 1 


±3 


± 3 


± 0.1 


±14 


T237A 


22 


11 


500 


7 


15 


2143 


10 


10 


1000 


2181 


678 


311 


14 


18 


1286 




± 0.4 


± 0.05 


± 11 


± 0.3 


± 0.1 


± 117 


± 0.7 


± 0.2 


± 94 


± 232 


± 56 


± 59 


± 1 


± 0.2 


± 110 


1C242E 


34 


8 


235 


36 


12 


333 


873 


94 


108 


3672 


147 


40 


104 


16 


154 




± 0.5 


± 0.03 


± 4 


± 0.6 


± 0.06 


± 7 


± 0.7 


± 0.05 


± 0.1 


± 31 


± 1 


± 0.6 


± 5 


± 0.4 


± 11 



a Units for K m , * cat and k c .JK m are uM, s l and s" l .mM"' f respectively. Values of standard errors are indicated below the kinetic values. 
b ND, not detectable (k cat < 0.05 s" 1 ). 



Table III. Multiple sequence alignment of (3-lactamases PER-1, TEM-1, 
TEM-5, SHV-8. Streptomyces albus G and Staphylococcus aureus PCI 



p-Lactamase Amino acid at position 3 Reference 





164 


170 


171 


179 


220 237 240 




PER-1 


A 


H 


A 


N b 


R T 


K c 


Nordmann and Naas 
















(1994) 


TEM-1 


R 


N 


E 


D 


L A 


E 


Sutcliffe (1978) 


TEM-5 


S 


N 


E 


D 


L T 


K 


Sougakoff (1989) 


SHV-8 


R 


N 


E 


N 


L A 


E 


Rasheed et at. (1997) 


S.albus G 


R 


N 


S 


D 


R Q 


R 


Dehottay et al (1987) 


S. aureus PCI 


R 


N 


Y 


D 


L A 


1 


East and Dyke (1989) 



Numbering according to Ambler et al (1991). 

b Boldface letters indicate the amino acids shared by PER-1 and other 

enzymes. 

"'Residue number 242 in PER-1. 



various ESBLs displaying a high activity against CAZ and 
AZT (Bush and Jacoby, 1997) (Table III). This residue was 
replaced by a glutamic acid, which is the residue found at 
position 240 in TEM-1 (Table III). As shown in Table II, the 
steady-state kinetic parameters determined from the K242E 
mutant were nearly identical with those measured from PER-1. 

Discussion 

We have described the catalytic behaviour of various PER-1 
mutants in which residues 164, 170, 171, 179, 220, 237 and 
242 were modified. 

In PER- 1, an alanine residue is found at position 164 instead 
of the highly conserved arginine identified in the other class 
A p-lactamases (Ambler et al, 1991). As reported above and 
as observed previously (Bouthors et al, 1998), replacement 
of Ala 164 by Arg in PER-1 resulted in a mutant protein which 
could not be detected on SDS-PAGE analysis and which 
displayed no detectable p-lactamase activity. In order to explain 
such a result, theoretical three-dimensional models of the class 
A (i-lactamase PER-1 and the corresponding mutant A164R 



were constructed and compared with each other (Figure 1A). 
Despite the relatively low degree of identity found at the 
amino acid level between PER-1 and the other class A 
P-lactamases, homology modelling was used to generate the 
model structures of PER-1 and the A164R enzyme because it 
is now well established that class A P-lactamases form a super 
family of enzymes that are all characterized by a very similar 
structural organization, particularly at the level of the active 
site (Joris et al, 1991). Molecular dynamic simulations were 
then performed from the models in order to assess the extent 
of the conformational modifications that could occur in the Q- 
loop region of the mutant by comparison with that of the wild- 
type enzyme. Based on the results obtained, the Q-loop region 
in PER-1 appears to be characterized by fairly high flexibility 
(data not shown). Such a result could be related to the fact 
that the PER-1 Q-loop is not stabilized by several ionic- 
bonding interactions, thus contrasting with the four salt bridges 
found in TEM-1 between the Q-loop residues Argl61 and 
Asp 1 63, Arg 1 64 and Glu 1 7 1 , Arg 1 64 and Asp 1 79, and Asp 1 76 
and Argl78 (Jelsch et al, 1993). Therefore, it is likely that 
the Ala 164 Arg substitution induces in PER-1 significant 
conformational modifications at the level of the Q-loop. 
Accordingly, the topology of the main-chain atoms between 
residues 171-179 is significantly different in the A164R 
mutant, when compared with PER-1 (r.m.s.d. = 0.6 A) 
(Figure 1A). In the the wild-type enzyme model, the Q-loop 
conformation is generally wider than in the mutant structure, 
the side chain of Asp 172 being oriented outwards the loop. 
By contrast, in the A164R enzyme, the bulky side chain of 
Argl64 would point inwards the Q-loop and, due to a putative 
salt bridge bonding interaction, the Asp 172 side chain would 
be reoriented towards that of Arg 164 (Figure 1A). Such a salt 
bridge cannot be established without a significant conforma- 
tional modification of the 172 region (Figure 1A), which 
accounts for the instability and the loss of activity of the 
mutant enzyme. 

Contrasting with the behaviour of the A164R mutant, the 
substitution in PER-1 of Alal71 by a glutamic acid yielded 
an enzyme characterized by kinetic parameters very similar to 
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ABC 

Fig. 1. Ribbon models of PER-1 (blue) and the two mutants A164R (red) and A164R+A171E (yellow). (A) Superposition of PER-1 and A164R; 
(B) superposition of PER-1 and A164R+A171E; (C) model of strands p3 and (34 in wild-type PER-1. In (A) and (B), the models represent the mean of the 
conformations obtained by dynamic simulations of the 160-180 region encompassing the Q-loop (residues 161-179). Model (C) was obtained by homology 
modelling and energy minimization, as decribed in the Materials and methods section. Thin colour sticks; side chains of residues 70, 164, 166, 170, 171, 172, 
220 and 237. Green dotted lines: hydrogen bonds. The figures were generated by using the program Insightll from Molecular Simulations. 



those obtained from the wild-type enzyme. Moreover, it is 
noteworthy that introduction of the Alal71 — > Glu mutation 
in the inactive mutant A164R yielded a double mutant, 
A164R+A171E, which was fully active and exhibited k cat and 
K m values nearly identical with those of PER-1 and A171E 
(Table II). Strikingly, in the minimized mean of the 
A164R+A171E structure model obtained by molecular 
dynamic simulations, the side chain of Glul71 could be 
adequately oriented to be hydrogen bonded and/or to form a 
strong ionic bond with the side chain of Argl64, while that 
of Asp 172 would be consequently oriented outwards the 
Q-loop, i.e. in a position nearly identical with that found in 
the wild-type PER-1 enzyme (Figure IB), explaining why the 
double mutant remained fully active despite the presence 
of Argl64. 

Two other amino acids (Asnl79 and His 170) were investi- 
gated in the Q-16op of PER-1. The asparagine residue, found 
at position 179 in PER-1 as in the ESBL SHV-8 (Rasheed 
et al, 1997) (Table III), was initially thought to play a specific 
role in the activity of PER-1 against expanded-spectrum 
cephalosporins. Unexpectedly, replacement of Asnl79 by an 
aspartate, which is a residue conserved in a large number of 
class A P-lactamases, was highly deleterious for the overall 
P-lactamase activity of the two PER mutants N179D and 
A164R+N179D (Table II). It must be pointed out that the 
interaction between residues 164 and 179, which links the two 
ends of the Q-loop region in class A P-lactamases, is important 
for a suitable positioning of the key catalytic residue Glu 166 
(Knox, 1995; Matagne et al, 1998). Therefore, it is tempting 
to speculate that the presence of an aspartate residue at position 
179 in the inactive mutants N179D and AI64R+N179D could 
alter significantly the position of Glu 166 and, thereby, the 
(3-lactamase activity. 

The histidine found at position 170 in PER-1 corresponds 
to a highly conserved asparagine residue in the other class A 



P-lactamases (Ambler et ai, 1991) (Table III). Unexpectedly, 
the kinetic parameters exhibited by the H170N mutant were 
similar to those obtained from PER-1, except for a 5.5-fold 
increase in the £ cat value for CTX with a concomitant decrease 
in the apparent affinity for this antibiotic. Paizkill et al (1994) 
have reported that the replacement of the highly conserved 
Asnl70 by a histidine in TEM-1 yielded an active enzyme 
showing unmodified catalytic constants. Taken altogether, these 
data suggest that His 170 is not a key residue for the substrate 
profile of PER-1 and one can hypothesize that this residue 
was present in the ancestor of the PER-1 p-lactamase and has 
been conserved during the evolution process leading to PER-1. 

Three positions in PER-1 were investigated in the region of 
the oc/p domain forming one of the two edges of the active 
site. Residue 237, located on the p3 strand, belongs to the 
so-called oxyanion pocket and is involved in the binding of 
p-lactams (Ghuysen, 1994; Matagne et al, 1998). In PER-1, 
a threonine is found at position 237 (Figure 1C), which is 
located between the KTG triad and Ser238. Strikingly, it 
has been previously reported that various TEM-type ESBLs 
harbour a A237T substitution (Bush and Jacoby, 1997). More- 
over, another hydroxy lated residue (a serine) is found naturally 
at position 237 in the class A P-lactamase from Proteus 
vulgaris which displays a high catalytic activity against CTX 
and it has been shown that the substitution Ser237 -» Ala in 
this enzyme leads to a decrease in the catalytic efficiency 
against this drug (Tamaki et al, 1994). Therefore, the decrease 
in £ cat observed for CTX with the T237A mutant of PER-1 
confirms that Thr237 is important for the catalytic activity of 
PER-1 towards this drug. However, the general increase in 
k c JK m observed for the T237A mutant of PER-1 against CTX, 
CAZ and AZT, which is due to a general increase in apparent 
affinity towards cephalosporins, was rather unexpected 
(Table II). Nonetheless, these results were confirmed by 
modifying the arginine found at position 220 in PER-1 . Indeed, 



317 



A.-T.Bouthors et ah 



according to the hypothetical model of PER-1 shown in Figure 
1C, the side chain of Arg220 would point towards the active 
site cavity and could be hydrogen-bonded to that of Thr237. 
As a consequence of this structural organization, it is likely 
that both residues contribute to adjusting the topology of the 
oxyanion pocket, as previously suggested for other class A 
P-lactamases (Matagne and Frere, 1995). In accordance with 
such a model, the replacement of Arg220 by Leu in PER-1, 
which leads to the loss of the hydrogen-bonding interactions 
between residues 220 and 237, yielded a mutant enzyme 
(R220L) showing kinetic properties similar to those exhibited 
by the T237A mutant, i.e. a significant decrease in the catalytic 
activity against CTX associated with a better apparent affinity 
for expanded-spectrum cephalosporins and AZT (see Table II). 

Finally, we also studied the lysine residue found at position 
242 at the end of the 03 strand in PER-1 (Figure 1C) } which 
might be the counterpart of Lys240 found in various TEM- 
type ESBLs (Bush and Jacoby, 1997). The replacement of 
Lys242 in PER-1 by a glutamic acid residue, which is the 
residue found at position 240 in TEM-1 (Table III), yielded a 
mutant enzyme with kinetic properties very similar to those 
of PER-1. This result indicates that Lys242 does not play in 
PER-1 a role equivalent to that of the lysine found at position 
240 in the TEM-type ESBLs. 

In conclusion, PER-1 is a class A ESBL which illustrates 
well the fact that enzymes showing a high level of divergence 
in their amino acid sequences can share very similar substrate 
profiles. Furthermore, our results indicate that, in contrast to 
the TEM-type ESBLs, the PER-1 activity towards expanded- 
spectrum cephalosporins does not stem from the presence in 
the active site of a limited number of residues having a specific 
role in the hydrolysis of these drugs. The X-ray structure 
determination of PER-1, which is in progress, will aid further 
understanding of the structure-activity relationships of this 
peculiar class A (3-lactamase. 
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ABSTRACT Renilla reniformis is an anthozoan coeknterate 
capable of exhibiting btotuminescence. BioJuminescence in Re- 
nilla results from the oxidation of coeknterate ludferin (coekn- 
terazlne) by luciferase [tfenitfa-ludferin: oxygen 2-oxidoreduc- 
tase (decarboxylating), EC 1.13.12.5]. In vivo, the exdted state 
ludferio-luciferase compkx undergoes the process of nonradi- 
ative energy transfer to an accessory protein, green fluorescent 
protein, which results in green biohiminescence. In vitro, Renilla 
luciferase emits blue light in the absence of any green fluorescent 
protein. A Renilla cDNA library has been constructed in Agtll 
and screened by plaque hybridization with two oligonucleotide 
probes. We report here the isolation and characterization of a 
luciferase cDNA and its gene product. The recombinant lu- 
ciferase expressed in Escherichia coli is identical to native 
luciferase as determined by SDS/PAGE, immunoblot analysis, 
and biolujninescence emission characteristics. 

Renilla reniformis (class Anthozoa) is a bioluminescent soft 
coral found in shallow coastal waters of North America, which 
displays blue-green bioluminescence upon mechanical stimu- 
lation (1, 2). The components involved in Renilla biolumines- 
cence have been described in detail (3). Renilla luciferase 
[/tem7/a-luciferin:oxygen 2-oxidoreductase (decarboxylating), 
EC 1.13.12.5] catalyzes the oxidative decarboxylation of coel- 
enterazine in the presence of dissolved oxygen to yield oxy- 
luciferin, C0 2 , and blue light (A^ = 480 nm) (4). This reaction 
has a bioluminescence quantum yield of «7%. The stoichi- 
ometry of this reaction and the detailed mechanism leading to 
excited-state formation have been described (4, 5). 

The color of in v/fro-catalyzed bioluminescence changes 
from blue to green upon addition of submicromolar amounts 
of an energy-transfer acceptor green fluorescent protein 
(GFP), which has been purified from Renilla and character- 
ized (6). This green fluorescence (A^ = 509 nm) is identical 
to the in vivo emission in Renilla. The energy-transfer pro- 
cess is nonradiative; an increase in both the quantum yield (6) 
and calculated lifetimes has been determined for this process 
(7). Luciferase and GFP form a specific 1:1 rapid equilibrium 
complex in solution (8). 

The elucidation of mechanisms involved in nonradiative 
energy transfer processes as well as determination of detailed 
structural information on both luciferase and GFP have been 
hindered by a lack of material. To overcome this, we have 
cloned, sequenced, and expressed in Escherichia coli a 
cDNA encoding Renilla luciferase.* 

MATERIALS AND METHODS 

Amiqo Acid Sequence Determination of Renilla Luciferase. 

Native Renilla luciferase was isolated as described (4). Pu- 
rified luciferase was digested with Staphlococcal protease 

The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked "advertisement" 
in accordance with 18 U.S.C. §1734 solely to indicate this fact. 



V-8 (9). The resulting peptides were purified by HPLC and 
subjected to NH 2 -terminal Edman sequencing as described 
(10). Based on these peptide sequences two 17-base oligo- 
nucleotide probes were synthesized with an Applied Biosys- 
tetns DNA synthesizer at the Molecular Genetics Instrumen- 
tation Facility at the University of Georgia. 

Construction of a cDNA Library in Agtll. Live R. reni- 
formis were collected at the University of Georgia Marine 
Institute located at Sapelo Island. The animals were frozen 
immediately in liquid N 2 and stored at -80°C. Frozen tissue 
was ground to a fine powder in liquid N 2 with mortar and 
pestle. Total RNA was isolated from the frozen powder by 
the guanidine thiocyanate method (11), and poly(A) + RNA 
was isolated by oligo(dT)-celluIose chromatography (12). 
cDNA was synthesized by the method of Gubter and Hoff- 
man (13). Phosphorylated EcoRl linkers (Collaborative Re- 
search) were ligated to the cDN As, which were then digested 
with £coRI. Separation of cDNA from free linkers after 
EcoRI digestion as well as size selection of cDNAs were 
accomplished by electrophoresis in low-melting-temperature 
agarose (NuSieve, FMC) (14). cDNAs were ligated into the 
EcoRl site of Agtll (15). The library was amplified in Y1088 
cells (16) by a plate method (17). 

Isolation and DNA Sequence Determination of a Luciferase 
cDNA. Oligonucleotide probes were 5' end-labeled with T4 
polynucleotide kinase (Bethesda Research Laboratories) and 
[r 32 P]ATP (3000 Ci/mmol; 1 Ci = 37 GBq; ICN) to specific 
activities srl x lO^cpm/jtgUS). A total of 6 x 10 5 recombinant 
plaque-forming units were screened by plaque hybridization 
(19). Phage DNA was isolated as described (20). A luciferase 
cDNA, isolated from the clone ARLuc-6, was subcloned into 
the M13 sequencing vectors mpl8 and mpl9, and sequencing 
templates were prepared (21). The DNA sequence of both 
strands was determined by the dideoxynucleotide chain- 
termination technique by using a Sequenase kit (United States 
Biochemical) and [a- S]dATP (400 Ci/mmol; Amersham) 
(22). The M13 universal primer and a Agtll sequencing primer 
(Amersham) were used to prime the sequencing reactions. 

Expression of Recombinant Ludf erase (r-ludf erase). Posi- 
tive clones were converted to lysogens in E. coli ¥1089 cells 
(16). Lysogens were grown at permissive temperatures and 
induced with 1 mM isopropyl /3-D-thiogalactopyranoside 
(IPTG). Crude cell extracts were prepared and assayed for 
luciferase activity as described below. The plasmid pTZR- 
Luc-1 was constructed by ligation of a 2.2-kilobase-pair (kbp) 
EcoKL/Sst I ARLuc-6 restriction fragment into the plasmid 
pTZ18R (Pharmacia), which contains the lacZ' gene. E. coli 
TG-1 cells (23) were transformed with pTZRLuc-1 (24). 
Single colonies were isolated and grown at 37°C in LB 

Abbreviations: GFP, green fluorescent protein; IPTG, isopropyl 
0-D-thiogalactopyranoside; r-luciferase, recombinant luciferase; 
ORF, open reading frame. 
*To whom reprint requests should be addressed. 
*The sequence reported in this paper has been deposited in the 
GenBank data base (accession no. M63501). 
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medium containing ampicillin (100 /tg/ml) to an OD^oo ° 
0.6-0.8 unit and induced with 1 mM IPTG for 4 hr. The cells 
were centrifuged at 10,000 x g and frozen solid at -20°C. The 
pellets were thawed and resuspended 1:5 in 10 mM EDTA, 
pH 8, and lysozyme at 4 mg/ml (Sigma). After 20-min 
incubation at 25°C, the cells were placed on ice for 1 hr and 
then sonicated for 30 sec with a Branson cell disrupter. The 
cell lysate was clarified by centrifugation at 30,000 x g. The 
clarified lysate was used in subsequent bioluminescence 
assays and emission studies. 

Assay for Renilla Luciferase Activity and Determination of 
Emission Spectra. Bioluminescence assays (4) were done with 
a Turner model TD-20e luminometer, and peak light inten- 
sities were determined. Bioluminescence intensity was con- 
verted to quanta per second by calibrating the instrument 
relative to a radioactive 63 Ni light standard that emits in the 
460- to 480-nm region (25). Corrected emission spectra were 
collected on an on-line computerized fluorimeter (26). A 
100-/xl sample of a clarified pTZRLuc-1 cell extract was 
added to 1 ml of luciferase assay buffer (4) or to 1 ml of 
"energy-transfer buffer* ' containing 1 x 10" 6 M GFP (8). An 
excess of coelenterazine (0.47 mM) dissolved in MeOH was 
added to maintain a strong emission signal. 

Genomic Southern Blot Analyses. A 790-bp EcoRl/BamHl 
cDNA restriction fragment was labeled to specific activities 
>1 x 10 9 cpm/jug with both [a- 32 P]dATP and dCTP (4000 
Ci/mmol, ICN) by the random hexamer-priming method (27). 
Genomic DNA was isolated from Renilla by a guanidine 
thiocyanate method developed for coelenterate DNA isola- 
tion (D. Prasher, personal communication). DNA samples 
were digested with the appropriate enzymes and resolved in 
a 0.8% agarose gel, followed by transfer to nitrocellulose 
filters (Schleicher & Schuell) (28). Aqueous hybridizations 
and washes were done at high stringencies as described for a 
homologous probe (17). 

Electrophoretic Analysis of Protein. Protein samples were 
analyzed on 12.5% SDS/PAGE gels that were fixed and 
stained with Coomassie blue as described (29). Immunoblots 
were done as described (30). Proteins were transferred to 
nitrocellulose (Schleicher & Schuell) and incubated in a 1:50 
dilution of rabbit anti-native luciferase antibody. Detection of 
the secondary antibody (horseradish peroxidase-conjugated 
goat anti-rabbit IgG) signal was determined according to the 
vendor's instructions (Bio-Rad). 

Computer-Facilitated DNA and Amino Acid Sequence Anal- 
yses. The DNA sequence was compiled and manipulated 
using MicroGenie sequence software (Beckman). 

RESULTS 

Synthesis of Luciferase Oligonucleotides. Seven luciferase 
peptides (V8-1-V8-7) were purified by HPLC, and their 
amino acid sequences were determined. Two of the peptides 
contained regions of relatively low codon degeneracy. Amino 
acid sequence from these regions was used to synthesize two 
mixed-sequence 17-base oligonucleotide hybridization 
probes with the following sequences; RLP-1 {GAR- 
AAYAAY-TTY-TTY-GT} and RLP-2 {AAR-AAR-TTY- 
CCN-AAY AT}, which are 32- and 64-fold degenerate, re- 
spectively. 



Nucleotide and Deduced Amino Add Sequence Analyses of 
Renilla Luciferase. Six clones were isolated from the Renilla 
cDNA library. One clone, ARLuc-6, hybridized to both 
oligonucleotide probes. The cDNA insert could not be iso- 
lated after EcoRl digestion, as one of the linker sites was lost 
during cloning. A double digest with EcoRl and Sst I pro- 
duced a 2.2-kbp fragment that contained a 1.2-kbp cDNA 
with 1 kbp of Agtll DNA at the 3' end. This fragment was 
subcloned into the M13 vectors mpl8 and mp!9. DNA 
sequencing provided the locations of six base restriction sites 
(Fig. 1), which were used to generate specific sequencing 
subclones. The entire 1.2-kbp luciferase cDNA was se- 
quenced on both strands. 

The cDNA, excluding the EcoRl linkers, is 11% nucleo- 
tides long and encodes an open reading frame (ORF) of 314 
amino acids (Fig. 2). Although an ATG in-frame codon is 
found at the 5' end of the cDNA, the intrinsic mRNA may 
contain additional 5' coding nucleotides. If the first ATG 
codon in the ORF is designated as the initiation codon, the 
predicted 311 amino acid sequence is essentially identical in 
size (34 kDa) and composition to native Renilla luciferase (4). 

Comparison of the deduced amino acid sequence with the 
native peptides reveals that ARLuc-6 encodes a luciferase 
cDNA (Fig. 2). One discrepancy lies at amino acid residue 
222, which is leucine in the peptide sequence but tryptophan 
in the deduced sequence. Sequencing autoradiograms from 
this region of the clone have been examined carefully and 
found free of any irregularities. The protein sequence also 
contains a consensus N-linked glycosylation site (Asn-Xaa- 
Ser) beginning at residue 92. 

Genomic Southern Analysis. A Renilla genomic Southern 
blot was probed with a 790-base-pair (bp) EcoRl/BamHl 
luciferase cDNA restriction fragment (Fig. 3). The BamUl 
digest, lane A, contains two hybridizing bands as does the 
Sma I digest, lane B. The Bgl II digest, lane C, contains four 
bands. If luciferase is encoded by a single gene containing no 
introns, a single band would be expected in the BamHl and 
Sma I digests, as these two sites are not spanned by the 
hybridization probe. Similarly, two bands would be expected 
in the Bgl II digest. That the BamHl and Sma I digests contain 
two hybridizing bands shows either that there is more than 
one luciferase gene or that the luciferase gene(s) has introns 
containing BamHl and Sma I sites. The four bands seen in the 
Bgl II could be explained by two very large introns containing 
Bgl II sites. When genomic DNA was digested with restric- 
tion enzymes having no sites within the cDNA sequence, 
there were always at least two or more hybridizing bands 
(data not shown). These results suggest that luciferase is 
encoded for by more than one gene, which may or may not 
contain introns. 

Luciferase Expression in E. coti. The ARLuc-6 lysogen is 
capable of low-level r-luciferase expression as determined by 
light emission from clarified, crude extracts (5 x 10 10 
hvsec" l -mT l ). When these cells are induced with 1 mM 
IPTG, light emission decreases by 2-fold; this happens be- 
cause the cDNA is reversely oriented with respect to the 
Agtll lacZ promoter. Presumably, when IPTG is absent, the 
luciferase gene is transcribed from a promoter in the right end 
of Agtll, as reported (31). 

The 2.2-kbp EcoRl/ Sst I fragment was subcloned into the 
plasmid pTZ18R, which uses the lacZ promoter. The ORF of 



CcoRI 
lAsull 



BamHl 

i r 
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100 bp 



Fig. 1. Location of six base restriction enzyme sites within the luciferase cDNA. The boxed region defines the ORF. The £coRI site at the 
5' end is a synthetic linker site. 



4440 Biochemistry: Lorenz et ai 



Proc. Natl. Acad, ScL USA 88 (1991) 



* * * * * 

1 AGC TTA AAG AJfi. ACT TCG AAA GTT TAT GAT CCA GAA CAA AGG AAA CGG ATG ATA ACT GGT 60 

1 Ser Leu Lyo Wet Thr Ser Lyo Val Tyr Aap Pro Glu Gin Arg Lya Arg Mot lie Thr Gly 20 

* * * * * * 

61 CCG CAG TGG TGG GCC AGA TGT AAA CAA ATG AAT GTT CTT GAT TCA TTT ATT AAT TAT TAT 120 

21 Pro Gin Trp Trp Ala Arg Cya Lya Gin Met Aan Val Leu Aap Ser Phe He Aan Tyr Tyr 40 

* * * * * * 

121 GAT TCA GAA AAA CAT GCA GAA AAT GCT GTT ATT TTT TTA CAT GGT AAC GCG GCC TCT TCT 180 

41 Aap Ser Glu Lya Hia Ala Glu Aan Ala Val lie Phe Leu Hia Gly Aan Ala Ala Ser Ser 60 

« * • * * * 

181 TAT TTA TGG CGA CAT GTT GTG CCA CAT ATT GAG CCA GTA GCG CGG TGT ATT ATA CCA GAT 240 

61 Tyr Leu Trp Arg Hia Val Val Pro Hia He Glu Pro Val Ala Arg Cya He He Pro Aap 80 

* * * * * * 

241 CTT ATT GGT ATG GGC AAA TCA GGC AAA TCT GGT AAT GGT TCT TAT AGG TTA CTT GAT CAT 300 

81 Leu He Gly Met Gly Lya Ser Gly Lya Ser Gly Aan Gly Ser Tyr Arg Leu Leu Aap Hia 100 

* * * * * * 

301 TAC AAA TAT CTT ACT GCA TGG TTT GAA CTT CTT AAT TTA CCA AAG AAG ATC ATT TTT GTC 360 

101 Tyr Lya Tyr Leu Thr Ala Trp Phe Glu Leu Leu Aan Leu Pro Lya Lya He He Phe Val 120 



361 GGC CAT GAT TGG GGT GCT TGT TTG GCA TTT CAT TAT AGC TAT GAG CAT CAA GAT AAG ATC 420 
121 Gly Hia Aap Trp Gly Ala Cya Leu Ala Phe Hia Tyr Ser Tyr Glu Hia Gin Aap Lya He 140 

* * * * * * 

421 AAA GCA ATA GTT CAC GCT GAA AGT GTA GTA GAT GTG ATT GAA TCA TGG GAT GAA TGG CCT 480 
141 Lya Ala He Val Hia Ala Glu Ser Val Val Aap Val He Glu Ser Trp Aap Glu Trp Pro 160 

* * * * * * 

481 GAT ATT GAA GAA GAT ATT GCG TTG ATC AAA TCT GAA GAA GGA GAA AAA ATG GTT TTG G&fi 540 
161 Aap He Glu |Clu Aap lie Ala Leu He Lya Her 51u Glu Gly mu byo wet val Leu sxu~[ 180 

* * * * * * 

541 AAT AAC TTC TTC GTG GAA ACC ATG TTG CCA TCA AAA ATC ATG AGA AAG TTA GAA CCA GAA 600 
181 |Aan Aan Phe Phe VaX Glu | Thr Met Leu Pro Ser Lya He Met Arg Lya Leu Glu Pro Glu 200 

* * * * * * 

601 GAA TTT GCA GCA TAT CTT GAA CCA TTC AAA GAG AAA GGT GAA GTT CGT CGT CCA ACA TTA 660 
201 Glu Phe Ala Ala Tyr Leu Glu Pro Phe Lya [Clu Lya Gly Glu val Arg Arg Fro Tnr Leu | 220 

+ * * * * * * 

661 TCA TGG CCT CGT GAA ATC CCG TTA GTA AAA GGT GGT AAA CCT GAC GTT GTA CAA ATT GTT 720 
221 |5er Trp Pro Arg Clu lie Pro Lou Val Lya GTy"] Gly Lya Pro Aap Val Val Gin He Val 240 

* * * * * * 

721 AGG AAT TAT AAT GCT TAT CTA CGT GCA AGT GAT GAT TTA CCA AAA ATG TTT ATT GAA TCG 780 
241 Arg Aan Tyr Aan Ala Tyr Leu Arg Ala Ser Aap Aap Leu Pro Lya Met Phe Ile |Glu Ser"| 260 

* * * * * * 

781 GAT CCA GGA TTC TTT TCC AAT GCT ATT GTT GAA GGC GCC &A C AAG TTT CCT RAT ACT GAA 840 
261 |Aap Pro Gly Phe Phe Ser Aan Ala He Val Clu (Sly Ala Lya Lya Phe Pro Aan Thr Clu | 280 

* * * * * * 

841 TTT GTC AAA GTA AAA GGT CTT CAT TTT TCG CAA GAA GAT GCA CCT GAT GAA ATG GGA AAA 900 
261 Phe Val Lya Val Lya Gly Leu Hia Phe Ser Gin Glu Aap Ala Pro Aap Glu Met Gly Lya 300 

* « * * * * 

901 TAT ATC AAA TCG TTC GTT GAG CGA GTT CTC AAA AAT GAA CAA IfcATTACTTT GGTTTTTTAT 960 
301 Tyr He Lya Ser Phe Val |Clu Arg Vai Leu Lya Aan CIu] Gln 314 

* * * * * * * 

963 TTACATTTTT CCCGGGTTTA ATAATATAAA TGTCATTTTC AACAATTTTA TTTTAACTGA ATATTTCACA 1032 



1033 GGGAACATTC ATATATGTTG ATTAATTTAG CTCGAACTTT ACTCTGTCAT ATCATTTTGG AATATTACCT 1102 
1103 CTTTCAATGA AACTTTATAA ACAGTGGTTC AATTAATTAA TATATATTAT AATTACATTT GTTATGTAAT. 1172 
117 3 AA&CTCGGTT TTATTA TAAA AAAA 1196 



Fig. 2. Nucleotide sequence and translated amino acid sequence of the Renilla luciferase cDNA. Putative and known translation control 
elements, as well as oligonucleotide hybridization sites, are underlined. Positions of native luciferase peptide sequences are boxed and, except 
at one residue (+), are identical to the deduced amino acid sequence obtained from the luciferase cDNA. Some of the native luciferase peptide 
sequences overlap at glutamic acid residues. 



the cDNA is not in frame with the lacZ' gene ORF of 
pTZ18R. Supernatants were prepared from IPTG-induced 
pTZRLuc-1 cells, as described, and the level of luciferase 
expression was measured by the standard luciferase assay. A 
high level of r-luciferase activity, 2 x 10 13 hvsec'^ml" 1 , is 
detected in clarified crude extracts of pTZRLuc-1 cells. This 
level of activity is 7-fold greater than in uninduced pTZR- 
Luc-1 cells. 

A prominent protein band (M r = 34,000) migrating to the 
position of native luciferase is seen after SDS/PAGE of pTZR- 



Luc-1 crude extracts (Fig. 4). Crude extracts of IPTG-induced 
pTZRLuc-1 cells were analyzed by immunoblotting (Fig. 5). 
The protein band that reacts with antiluciferase antibody, lane 
B, corresponds to the same band seen in the Coomassie-stained 
gel (Fig. 4). Native luciferase was used as a positive control, 
lane A. No signal is detectable in the crude extract of pTZ18R 
cells, lane C. A duplicate filter incubated with preimmune 
serum showed no detectable signal. 

Bioluminescence Emission Spectra. The r-luciferase* 
catalyzed bioluminescence emission spectrum (Fig. 6a) is 
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Fig. 3. Genomic Southern blot of R. 
rentformis DNA hybridized to a 790-bp 
EcoRl/BamHl luciferase cDNA frag- 
ment. Genomic DNA had been digested 
with the following restriction enzymes: 
BamHl (lane A); Sma I (lane B); and Bgl 
II (lane C). Samples were resolved in a 
0.8% agarose gel and transferred to ni- 
trocellulose. Each lane contains 20/xg of 
digested genomic DNA. Molecular size 
markers are in kbp. 



very similar to that seen with native luciferase (32). The 
r-luciferase emission spectrum has a A max = 480 nm and a 
slight shoulder at 400 nm, which correspond to emission from 
the excited-state oxyluciferin monoanion and neutral spe- 
cies, respectively. Disproportionation between these species 
is sensitive to environmental factors (7); thus, this spectrum 
indicates the strong similarity of the active-site environment 
between r-luciferase and the native enzyme. Although an 
increase in quantum yield has yet to be determined, r-luci- 
ferase can clearly transfer energy in the presence of Renilla 
GFP (Fig. 6b). The emission spectrum dramatically shifted 




L'ase 



Fig. 4. SDS/PAGE analysis of total protein from IPTG-induced 
E. coli cells transformed with either pTZRLuc-1 or pTZ18R. Ten- 
milliliter cultures were grown to an OD^oo = 0.8 and induced with 1 
mM IPTG for 4 hr. One-milliliter of cell culture, OD^o - 5.0, was 
pelleted and resuspended in 0.5 ml of SDS sample buffer. Samples 
were boiled for 5 min, and 20 pi was loaded per lane: native luciferase 
(10 fig) (lane A); pTZRLuc-1 cells (lane B); and pTZ18R cells (lane 
C). Molecular weight (M T x 10" 3 ) standard positions are indicated. 
Arrow shows position of native luciferase (L'ase). 




Fig. 5. Immunoblot analysis of total 
protein. Sample preparation and electro- 
phoresis were the same as in Fig. 4. 
Native luciferase (2 pg) (lane A); 10 fi\ of 
pTZRLuc-1 cell extract (lane B); and 10 
/xl of pTZ18R cell extract (lane C). Mo- 
lecular weight (M r x 10" 3 ) standard po- 
sitions are indicated. 



from the broad emission band generated by r-luciferase to the 
narrow, structured emission band (A,™* = 509) seen when 
GFP is present. The emission spectrum generated with 
r-luciferase and GFP is very similar to the same spectrum 
generated with native luciferase and GFP (33). 




440 480 BO 9n 100 

Wavelength Ini) 

Fig. 6. Btoluminescence emission spectra generated with crude 
r-luciferase and r-luciferase plus GFP. Crude pTZRLuc-1 cell ex- 
tracts were prepared, as described, and 100 pi, «1 x 10~ 6 M 
r-luciferase, as determined by peak light emission, was used to 
generate each spectrum, {a) Emission spectrum of crude r-luciferase. 
(b) Spectrum that results when 1 x 1(T 6 M Renilla GFP is added to 
crude r-luciferase in energy-transfer buffer. 
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DISCUSSION 

This work describes the isolation of a 1.2-kbp R. rentformis 
luciferase cDNA capable of directing the expression of 
r-Iuciferase. The cDNA contains an ORF encoding a 314- 
amino acid sequence in which all of the native luciferase 
peptide sequences obtained from V8-protease digestion are 
found. Rescreening the cDN A library with a 790-bp luciferase 
cDNA fragment as a hybridization probe has failed to pro- 
duce other clones that contain the 5' noncoding region; 
therefore, whether the luciferase cDNA is full-length is not 
known. This uncertainty can be resolved by sequencing 
genomic clones corresponding to the 5' end of the luciferase 
gene. 

The genomic Southern hybridization data indicates that 
Renilla luciferase is probably encoded by more than one 
gene, which may or may not contain intervening sequences. 
Further characterization of luciferase genomic clones will be 
required before the genetic organization of the luciferase 
gene(s) can be defined. 

A putative initiation codon located at triplet position 4 of 
the ORF may be the translation initiation site for Renilla 
luciferase; the 311-amino acid sequence is essentially iden- 
tical to native luciferase with respect to its composition and 
predicted molecular weight. Irrespective of whether this 
cDNA is full length, the luciferase that it encodes is ex- 
pressed in pTZRLuc-1 cells and is catalytically active. The 
expression data demonstrate that r-luciferase is the same size 
as native luciferase on SDS/PAGE gels and is reactive with 
polyclonal rabbit antibodies raised against native Renilla 
luciferase. Expression of r-luciferase from the plasmid pTZR- 
Luc-1 is "leaky" because activity can be detected from 
uninduced cell cultures. The luciferase cDNA ORF is not in 
frame wjth the short lacZ' ORF contained in this construct. 
Any translation product initiating at the 0-galactosidase 
sequence of pTZRLuc-1 would be terminated at a stop codon 
immediately adjacent to the putative initiation codon in the 
luciferase cDNA. Thus, the r-luciferase seen in SDS/PAGE 
gels does not contain any /3-galactosidase sequence. We 
propose that expression of r-luciferase by pTZRLuc-1 is due 
to a translation coupling mechanism (34). 

r-luciferase displays two very important characteristics of 
native luciferase: the ability to catalyze coelenterazine oxi- 
dation with the concomitant emission of blue light (A,^ = 480 
nm) and the ability to transfer energy to Renilla GFP with the 
production of green light (A^ = 509 nm). The two emission 
bands at 400 nm and 480 nm in the r-luciferase spectrum 
verify the strong similarity between the native and recombi- 
nant proteins and suggest that the integrity of the luciferase 
active site has been maintained. Furthermore, that energy 
transfer occurs in the presence of GFP shows that the 
luciferase domain(s) required for the interaction between 
luciferase and GFP is present in r-luciferase. Once pure 
r-luciferase is available, energy transfer quantum yield mea- 
surements will offer a more quantitative determination of the 
efficiency of the nonradiative energy-transfer process. Fi- 
nally, the data demonstrate that N-linked glycosylation is not 
required for luciferase activity because E. colt do not perform 
this modification (35). r-luciferase in E. coli crude extracts 
behaves like the native enzyme by every criterion examined 
thus far. 
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bioluminescence emission spectral data; Leon Dure, Claiborne 
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Gray for technical assistance. This is contribution number 659 from 
the University of Georgia Marine Institute, Sapelo Island, GA. This 
work was funded, in part, by a grant from ELA Technologies Inc. 

1. Harvey, E. N. (1952) in Bioluminescence, ed. Harvey, E. N. 
(Academic. New York), pp. 148-194. 

2. Morin, J. G. (1974) in Coelenterate Biology: Reviews and 
Perspectives, eds. Muscatine, L. & Lenhoff, H. M. (Academic, 
New York), pp. 397-438. 

3. Cormier, M. J. (1978) in Bioluminescence in Action, ed. Her- 
ring, P. J. (Academic, New York), pp. 75-108. 

4. Matthews, J. C, Hori, K. & Cormier, M. J. (1977) Biochem- 
istry 16, 85-91. 

5. Hart, R. C, Stempel, K. E., Boyer, P. D. & Cormier, M. J. 
(1978) Biochem. Biophys. Res. Commun. 81, 980-986. 

6. Ward, W. W. & Cormier, M. J. (1979) /. Biol. Chem. 254, 
781-788. 

7. Hart, R. C, Matthews, J. C, Hon, K. & Cormier, M. J. (1979) 
Biochemistry 18, 2204-2210. 

8. Ward, W. W. & Cormier, M. J. (1978) Photochem. PhotohioL 
27, 389-396. 

9. Houmard, J. & Drapeau, G. R. (1972) Proc. Natl. Acad. Sci. 
USA 69, 3506-3509. 

10. Charbonneau, H., Walsh, K. A., McCann, R. O., Prendergast, 
F. G., Cormier, M. J. & Vanaman, T. C. (1985) Biochemistry 
24, 6762-6771. 

11. Chirgwin, J. M., Przybyla, A. E., MacDonald, R. J. & Rutter. 
W. J. (1979) Biochemistry 18, 5294-5299. 

12. Aviv, H. & Lcder, P. (1972) Proc. Natl. Acad. Sci. USA 69, 
1408-1412. 

13. Gubler. U. & Hoffman. B. J. (1983) Gene 25, 263-269. 

14. Weislander, L. (1979) Anal. Biochem. 98, 305-309. 

15. Young, R. A- & Davis, R. W. (1983) Proc. Natl. Acad. Sci. 
USA 80, 1194-1198. 

16. Ausubel, F. M., Brert, R., Kingston, R. E., Morre, D. D., 
Seidmon, J. G., Smith, J. A. & Stmhl, K. (1987) Current 
Protocols in Molecular Biology (Wiley, New York). 

17. Maniatis, T., Fritsch, E. F. & Sambrook, J. (1982) Molecular 
Cloning: A Laboratory Manual (Cold Spring Harbor Lab. , Cold 
Spring Harbor, NY). 

18. Chaconas, G. & van de Sande, J. H. (1980) Methods Enzymol. 
68, 75-88. 

19. Wood, W. I., Gitschier, J., Lasky, L. A. & Lawn, R. M. (1985) 
Proc. Natl. Acad. ScL USA 82, 1585-1588. 

20. Grossberger, D. (1987) Nucleic Acids Res. 15, 6737. 

21. Yanisch-Perron. C, Vieria. J. & Messing, J. (1985) Gene 33, 
103-119. 

22. Sanger, F., Nicklen, S. & Coulson, A. R. (1977) Proc. Natl. 
Acad. Sci. USA 74, 5463-5467. 

23. Gay, N. J. & Walker, J. E. (1985) Biochem. J. 225, 707-712. 

24. Kushner, S. R. (1978) in Genetic Engineering, eds. Boyer, 
H. W. & Nicosia, S. (Elsevier Biomed., Amsterdam), pp. 
17-23. 

25. O'Kane, D. J. & Lee, J. (1990) Photochem. Photobiol. 52, 
723-734. 

26. Mulkerrin, M. G. & Wampler, J. E. (1982) Anal. Chem. 54, 
1778-1782. 

27. Feinberg, A. P. & Vogelstein, B. (1983) Anal. Biochem. 132, 
6-13. 

28. Southern, E. M. (1975) J. Mol. Biol. 98, 503-517. 

29. Laemmli, U. K. (1970) Nature (London) 277, 680-685. 

30. Burnett, W. H. (1981) Anal. Biochem. 112, 195-203. 

31. Chirala, S. S. (1986) Nucleic Acids Res. 14, 5935. 

32. Matthews, J. C, Hori, K. & Cormier, M. J. (1977) Biochem- 
istry 16, 5217-5220. 

33. Ward, W. W. & Cormier, M. J. (1978) Methods Enzymol. 57, 
257-267. 

34. Stormo, G. D. (1986) in Maximizing Gene Expression, eds. 
Rczinikoff, W. & Gold, L. (Butterworths, Boston), pp. 195- 
224. 

35. Kornfeld, R. & Komfeld, S. (1985) Anna. Rev. Biochem. 54, 
631-664. 



38 © 1999 Nature America Inc. • http://biotech.nature.com 



RESEARCH 



Directed evolution of the surface 
chemistry of the reporter enzyme 
^-glucuronidase 

Ichiro Matsumura 1 , John B. Wallingford 23 , Neeraj K. Surana 14 , Peter D. Vize 2 , and Andrew D. Ellington 1 * 

1 Institute of Cellular and Molecular Biology, 1CMB A 4800/MBB 3. 424, 26th and Speedway, University of Texas, Austin, TX 78712. 2 Center for Developmental 
Biology, Department of Zoology, University of Texas, Austin, TX 78712. 3 Present address: Department of Molecular and Cell Biology, University of California, 
Berkeley, Berkeley, CA 94 720. 4 Present address: Division of Biology and Biomedical Sciences, Washington University, St. Louis, MO 63108. Corresponding author 

(e-mail: andy.elIington@mail. utexas.edu). 

Receieved 29 December 1999; accepted 4 May 1999 

The use of the Escherichia coli enzyme ^-glucuronidase (GUS) as a reporter in gene expression stud- 
ies is limited due to loss of activity during tissue fixation by glutaraldehyde or formaldehyde. We have 
directed the evolution of a GUS variant that is significantly more resistant to both glutaraldehyde and 
formaldehyde than the wild-type enzyme. A variant with eight amino acid changes was isolated after 
three rounds of mutation, DNA shuffling, and screening. Surprisingly, although glutaraldehyde is known 
to modify and cross-link free amines, only one lysine residue was mutated. Instead, amino acid changes 
generally occurred near conserved lysines, implying that the surface chemistry of the enzyme was 
selected to either accept or avoid glutaraldehyde modifications that would normally have inhibited func- 
tion. We have shown that the GUS variant can be used to trace cell lineages in Xenopus embryos under 
standard fixation conditions, allowing double staining when used in conjunction with other reporters. 
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Since plants express endogenous (J-galactosidase activity, lacZ can- 
not be employed as a reporter gene 1 . Instead, the Escherichia co/ip- 
glucuronidase gene (gusA, formerly uidA) has been developed as a 
reporter gene for plants, and has been widely used for over a 
decade 2 . Both chromogenic and fluorogenic GUS substrates have 
been synthesized 3 , allowing rapid nonradioactive assays. The GUS 
enzyme is stable and active under a variety of conditions 1 , even 
when fused to other sequences 4 . 

The utility of GUS as a reporter, however, has been constrained 
in three ways. First, many animal .systems, and some plants and 
plant-associated bacteria express endogenous glucuronidase activi- 
ties 23 . Second, GUS activity is greatly reduced during tissue fixation 
by glutaraldehyde or formaldehyde, making it necessary to trade off 
retention of activity for preservation of tissue structure 5 . Third, 
both of these considerations drastically restrict the use of GUS as a 
reporter gene in vertebrate systems 6 . 

Enzymatic inactivation by aldehydes is largely due to the for- 
mation of Schiff bases with surface-accessible lysine residues 7 . 
While the removal of lysine residues by directed mutation might 
render an enzyme more resistant to fixatives, many surface lysines 
are critical for function and cannot be readily changed. The 
sequences of the £ coli 8 , human 9 , mouse 10 , rat 11 , and dog 12 
homologs are known. Six of the 27 lysine residues in the £ coli 
protein are conserved in the other species and thus are likely 
essential. Moreover, to find what combination of the 27 lysine 
residues could be changed in order to increase resistance to fixa- 
tives without abrogating enzyme activity would require con- 
structing and assaying a daunttngly large number of mutant 
enzymes. Therefore, in order to alter the surface chemistry of 
GUS, either to avoid or to accommodate aldehyde modifications 
without loss of enzyme activity, we employed a random mutation- 



al approach similar to those previously proven useful for altering 
enzyme substrate specificity 13 or thermostability 14 . 

Results 

Directed evolution of glutaraldehyde-resistant variants. Random 
mutations were initially introduced into the gusA structural gene by 
mutagenic PCR 15 . Mutated PGR products were ligated into the 
expression vector gusA-pBSA and transformed into £ coli. When 
the library was induced on plates containing the chromogenic GUS 
substrate, 5-bromo-4-chloro-3-indolyl-p-D-glucuronide (X-gluc), 
approximately 80% of the colonies were visibly less green than con- 
trol colonies expressing only the chromosomal gene (see 
Experimental protocol). p-Glucuronidase functions as a tetramer 16 , 
so it was likely that many of the mutations in the highly expressed, 
plasmid-borne library had a dominant negative effect on the func- 
tion of the chromosomal gene. This did not deter us from utilizing 
this library for screening experiments, since successive rounds of 
DNA shuffling should efficiently select against neutral or deleteri- 
ous mutations 13 . 

Nine thousand replica-plated colonies, each expressing a ran- 
domly mutated gusA gene, were exposed to buffer containing 0.2% 
glutaraldehyde for 20 min. The colony remnants were then incubat- 
ed in buffer containing X-gluc and the histochemical indicator, 
nitroblue tetrazolium (NBT) (Fig. 1). The catalytic activity of the 
wild-type enzyme is greatly diminished under those conditions, 
indicating that the glutaraldehyde disrupts the cell membranes and 
covalently modifies many intracellular proteins, including GUS (cf. 
Figs 2 A and B) . Of all the variants examined, only 10 colonies repro- 
ducibly exhibited greater catalytic activity than control colonies 
expressing wild-type gusA (Fig. 2B). The corresponding colonies on 
master plates were isolated, and their expression vectors purified 
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Figure 1. Screen for glutaraldehyde-resistant ^-glucuronidase (GUS) 
function (sequence from top left). A library of randomly mutated p- 
glucuronidase genes {gusA) is subcloned into an inducible 
expression vector and transformed into Escherichia coii. The 
resulting colonies are transferred to a nitrocellulose filter, which is 
overlaid upon an agarose plate containing an inducer and incubated 
for 12-24 h at 37°C. The filter-bound colonies are incubated in buffer 
containing glutaraldehyde, then transferred to buffer containing the 
histochemical indicators of p- glucuronidase, X-gluc and NBT. The 
brief incubation in 4 M guanidine HCI (Gdn-HCI) arrests color 
development. Colonies that retain GUS activity are isolated from the 
original plate and randomly recombined by DNA shuffling for the next 
round of screening. 



and pooled. The variant gusA genes were amplified using the PCR 
and randomly recombined by DNA shuffling 17 . We then screened 
6,000 random recombinants in a second round for variants that 
retained catalytic activity after a 20 min incubation in 1.0% glu- 
taraldehyde. Nine colonies contained variants that exhibited greater 
residual catalytic activity than the most resistant clone isolated in 
the first round of screening (Fig. 2C) . These variants were again 
pooled, amplified, and randomly recombined. Then, 6,000 recom- 
binants were screened in a third round for variants that retained 
catalytic activity after a 20 min incubation in 3.5% glutaraldehyde. 
Again, nine improved clones were isolated, one of which (GUS AR ) 
reproducibly showed the greatest activity under the most stringent 
conditions (Fig. 2D). 

In vitro characterization of GUS AR . To determine whether the 
colony-lift assay significantly influenced the apparent fixative- 
resistant phenotype of GUS AR , activity assays were also carried out 
in cell extracts. The GUS-deficient strain, pREP4/GMS407, was 
transformed with vectors that expressed either wild-type gusA, the 
GUS AR variant, or the lacZ a-fragment. Cell extracts from induced 
cultures were treated with glutaraldehyde or formaldehyde for 20 
min at 23°C, and diluted 100-fold in buffer containing saturating 
concentrations of a GUS substrate, p-nitrophenyl p-D-glu- 
curonide (PNPG). The extracts containing wild type or GUS AR 
catalyzed the hydrolysis of PNPG; no hydrolysis was detected in 
the negative control extracts, in which only the lacZ a-fragment 
was expressed (data not shown). Treatment of the extract contain- 
ing wild-type GUS with only 0.04% glutaraldehyde for 20 min at 
23°C reduced catalytic activity by 99.6 ± 0.24%. In sharp contrast, 
the GUS AR extract retained 78. 1 ± 0.69% of its activity after treat- 
ment with a fivefold higher (0.2%) concentration of glutaralde- 
hyde (Fig. 3A). 
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Figure 2. Detection of glutaraldehyde-resistant GUS activity. 
Escherichia coii cells transformed with vectors expressing the wild- 
type (A, B left), a pool of the ten glutaraldehyde-resistant variants 
from round 1 (B right, C left) or the most resistant variants from 
rounds 2 (C right, D left), or 3 (D right) were streaked onto 
noninducing plates. The colonies were propagated, induced, and 
treated for 20 min with the indicated concentrations of 
glutaraldehyde, then reacted with X-gluc and NBT, as described in 
the legend to Figure 1 . 



Extracts were also separately treated with formaldehyde to assess 
whether the fixative- resistant phenotype was specific to glutaralde- 
hyde. Again, the GUS AR variant exhibited much greater resistance to 
the fixative than did the wild-type GUS. The wild-type extract 
retained only 4.6 ± 0.05% of its catalytic activity after treatment 
with 0.08% formaldehyde; the GUS AR extract retained 62.4 ± 0.40% 
activity after incubation with 0.4% formaldehyde (Fig. 3B). To 
determine how sequence and chemical modifications may have 
influenced GUS activity, we conducted kinetic studies of the mutant 
enzyme (Fig, 3C). The wild-type and evolved gus genes were sub- 
cloned, expressed as fusion proteins with N-terminal hexahistidine 
tags, and purified by immobilized metal ion adsorption chromatog- 
raphy. Purified enzymes were assayed with varying concentrations 
of PNPG (Fig. 3C). The kinetic parameters of the wild- type enzyme 
(K M for the complex with PNPG = 1 10 ± 2.9 uM; ^ = 920 ± 7.3 s* 1 ) 
were very similar to those of the GUS AR variant (Km = 1 50 ± 3.9 u.M; 
k m = 750 ± 5.8 s* 1 ). The kinetic parameters of wild-type and evolved 
enzymes were also determined following reaction with a sublethal 
concentration (0.04%) of formaldehyde for 20 min at 23°C. 
Formaldehyde had a larger effect on the turnover number (250 ± 17 
s' 1 for the partially modified wild type, 650 ± 4.7 s' 1 for the modified 
GUS AR variant), than on the Michaelis constants (99 ± 6.7 u.M and 
170 ± 3.7 u,M for the modified wild-type and GUS AR enzymes, 
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Figure 3. GUS catalytic activity as a function of aldehyde concentration. Escherichia coli cell extracts containing the wild-type (circles) or evolved 
GlIS** (squares) {^-glucuronidase were incubated for 20 min at 23°C in buffer containing the indicated concentrations of glutaraldehyde (A) or 
formaldehyde (B). The protein was then diluted 100-fold into buffer containing the chromogenic GUS substrate, p-nitrophenyl-p-D-glucuronide 
(PNPG). Hydrolysis of the substrate was followed at 405 nm using a spectrophotometer (see Experimental Protocol). Each point represents the 
average of three initial velocity values; the points subsume the error bars. The control extract from an isogenic strain not expressing GUS does not 
have detectable activity (not shown). (C) Purified wild-type (circles) or evolved GUS*" (squares) enzymes were incubated for 20 min at 23° C in buffer 
containing 0% (empty symbols) or 0.04% (filled symbols) formaldehyde, then reacted with the indicated concentrations of PNPG. The initial velocity 
values were fitted to the Michaelis-Menten equation (lines); the derived kinetic parameters are presented in the text. 



respectively). These results in conjunction with the cell extract 
data show that GUS AR is expressed at lower levels than the wild 
type, but is inherently more aldehyde resistant. 

Sequence and structural mapping of the evolved GUS AR vari- 
ant. Upon sequencing, the evolved gusA gene was found to contain 
the following amino acid substitutions: N66D, D151N, A219V, 
I396T. T480A, Q498R, D508E and K567R, as well as six silent 
mutations. Only one of these changes, the D508E mutation, 
results from a transversion, indicating a strong transition bias in 
our random mutagenesis methods. The amino acid sequences of 
the E. coli and human GUS proteins are 50% identical 18 and could 
be readily aligned by the algorithm devised by Needleman and 
Wunsch 19 using the program GAP 4.0 (Genetics Computing 
Group. Madison, WI) . Both proteins are tetramers 1618 and are vir- 
tually identical in substrate specificity 1 . The positions of the loci 
that were altered in the evolved E. coli enzyme could thus be tenta- 
tively mapped onto the crystal structure of the human GUS pro- 
tein 18 (Fig. 4). 

The 10 gusA mutants isolated in the first round of screening 
were sequenced; mutations were found at a frequency of three per 
1.8 kb. Seven of the first-round variants contained amino acid 
substitutions (K567R. T480A, D508E or N66D/D151N) that were 
subsequently found in the most active third-round GUS AR variant. 
It is instructive that many of the single substitutions that confer 
modest resistance to aldehyde modification can interact additively 
or synergistically to confer robust resistance to aldehyde modifica- 
tion. 

GUS AR as a lineage tracer in Xenopus embryos. The N358S 
mutant of GUS is a commercially available and commonly used 
reporter gene in plants 4 . The N358S mutation eliminates a cryptic 
glycosylation site, and should not affect its function in the cyto- 
plasm; we chose this construct because it also contained the 
upstream sequences necessary for expression in eukaryotic cells 20 . 
In order to determine if the fixative-resistant GUS might also 
prove useful in other model organisms, transcripts encoding 



GUS AR and N358S GUS were microinjected into 16- to 32-cell 
stage Xenopus embryos. Two days later, the embryos were fixed in 
3.7% formaldehyde for 20 min and stained using a standard pro- 
tocol for the detection of lacZ expression, except that X-gluc was 
substituted for X-gal. The descendants of cells injected with the 
wild-type-like N358S GUS mRNA did not change color (Fig. 5A). 
Endogenous GUS activity was apparently also abrogated by the 20 
min incubation in 3.7% formaldehyde. In contrast, the descen- 
dents of cells injected with the GUS AR mRNA turned bright blue- 
green (Fig. 5B). 

In order to determine if multiple reporters might be used in 
tandem for lineage analysis, mRNAs encoding either N358S or the 
GUS AR were co-injected into embryonic cells along with mRNA 
encoding lacZ. When the embryos were first stained with X-gluc, 
again only the cells that inherited the GUS AR mRNA turned blue- 
green (Fig. 6A and B). The embryos were subsequently stained 
with rose-gal, a p-galactosidase substrate that forms a red precipi- 
tate. In embryos that received either the N358S or GUS AR , some 
cells were colored red, indicating the inheritance of lacZ. However, 
in embryos that received GUS AR some cells or patches were also 
purple, indicating the co-inheritance of the GUS AR and lacZ (Fig. 
6C and D). The lacZ mRNA in this experiment also served as an 
internal control that demonstrated that RNAs were entering cells 
and surviving until fixation. 

Discussion 

Mechanisms of aldehyde resistance. The E. coli GUS protein con- 
tains 27 lysine residues, six of which are conserved among the 
sequenced GUS genes. Although the fraction of lysine residues 
that are modified by aldehydes is unknown, wild-type GUS activi- 
ty is quite susceptible to even low levels of fixatives. For example, 
the catalytic activities of the wild-type (Fig. 3B) and N358S GUS 
(Figs 5A and 6A) are inactivated by <3.7% formaldehyde, while p- 
galactosidase, which contains 20 lysine residues, is not (Fig. 6). 
Taken together, these results suggest that one or more of the GUS 
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Figure 4. Homology mapping of amino acid substitutions that confer 
aldehyde resistance. The crystal structure of a subunit of the Ca 
trace of human GUS 18 is shown. The amino acid sequences of the E. 
co// and human GUS proteins were aligned using the application GAP 
4.0 (Genetics Computer Group) and were found to be 48.5% identical. 
The positions of lysine residues are darkened, and the conserved 
lysines are labeled. The positions of amino acid substitutions in the 
evolved GUS*" E. co//' protein are shown as balls. 

lysine residues is either itself critical for activity or presents a con- 
jugation site that leads to functional disruption. However, identi- 
fying which of the many lysine residues in GUS were responsible 
for inhibition by fixatives would have been a daunting task. 
Instead, we relied on a random mutagenesis to identify GUS variants 
with catalytic activity resistant to aldehydes. Following three rounds 
of screening and amplification, we isolated an octuple-mutant 
GUS AR with catalytic activity resistant to roughly 80- fold higher lev- 
els of glutaraldehyde than the wild-type activity (Fig. 3A). 

Surprisingly, only one of the amino acid substitutions, K567R, 
in the evolved GUS AR occurred at a lysine residue. Since AAA or 
AAG encodes lysine, the apparent transition bias in our random- 
mutagenesis method and the size of our initial library provided 
ample opportunities for each lysine to conservatively mutate into 
arginine (AGA or AGG). While it is possible that mutation of this 
single lysine was largely responsible for protection against aldehy- 
des, this explanation is unlikely. Three of the 10 clones isolated in 
the first round of screening contain the K567R sequence substitu- 
tion, but none of these first-round isolates are as resistant as any 
of the second-round isolates (Fig. 2C). The aldehyde resistance of 
GUS progressively increased over three rounds of screening and 
selection, and the final product had accumulated seven additional 
amino acid substitutions. The finding' that amino acid substitu- 
tions that modulate protein function are dispersed in the primary 
and tertiary structure of GUS is congruent with previous attempts 
to evolutionarily engineer the physical and kinetic parameters of 
enzymes. Experiments that directed an increase in the catalytic 
activity of a p-nitrobenzyl esterase in organic enzymes yielded 
multiple sequence substitutions scattered throughout the tertiary 
structure 21 . Site-directed mutation studies of T4 lysozyme have 
shown that stabilizing amino acid changes, which occur in the 
core of that enzyme, are additive in effect 22 . 
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Interestingly, the seven non-lysine amino acid substitutions 
mapped onto the surface of the protein near lysine residues (Fig. 
4). Protein structure is more highly conserved than protein 
sequence 23 , and since the primary sequences of the E. coli and 
human GUS enzymes are quite similar (48.5% identity 5 ), it can be 
conservatively assumed that their tertiary structures also align 
well. Based on this assumption, we can advance hypotheses 
regarding the contributions of individual amino acid substitu- 
tions to aldehyde resistance. For example. Lys568 [E. coli number- 
ing) is conserved among the sequenced GUS genes, and is in the 
active-site 18 . The Ca-Ca distance from Lys568 to the D508E sub- 
stitution is 3.97 A (Fig. 4). Since the lysine side chain is 7 A in 
length, this adjacent sequence substitution might raise the p/C a of 
the epsilon amino group of Lys568, thereby reducing its reactivity 
with aldehydes. Similarly, the K567R substitutions already men- 
tioned is within 3.81 A of the active-site lysine, and mutation to 
arginine may prevent modification that could sterically interfere 
with substrate binding or catalysis. 

Similarly, ^-glucuronidase is active only as a tetramer 1618 , and 
lysines play a key role in its quaternary structure. To the extent 
that modifications of interfacial lysines disrupt quaternary struc- 
ture and enzymatic function, adjacent amino acid substitutions 
could render these lysines less reactive. In this regard, the loop 
containing D151N and three lysines is <5 A away from the a-helix 
of the adjacent subunit containing T480A and three other lysines. 
These amino acid substitutions could also prevent structural and 
functional disruption by independently increasing the affinity 
between the subunits. For example, the A219V substitution also 
maps to the other interface, although it is not immediately adja- 
cent to any lysines. 

Overall, it appears the surface chemistry of the enzyme has 
coordinately evolved either to cause lysines to be less reactive or to 
functionally accommodate covalent modification of lysines. Our 
results suggest that there may be multiple possible routes by which 
proteins could be adapted to function in a wide variety of fixatives 
or solvent systems. More importantly, they suggest a way of aug- 
menting protein chemistry by introducing amino acids with novel 
surface conjugates. 




Figure 5. Expression of GUS** in Xenopus embryos. Embryos at the 
16- to 32-cell stage were injected with 1 ng of mRNA encoding N358S 
(A) or GUS™ (B) and fixed two days later in 3.7% formaldehyde for 20 
min. GUS activity was detected using the chromogenic substrate X- 
gluc (fight blue/green). The reddish-purple color of the cement gland 
of the embryo shown in (A) is from a natural pigment, and the blue 
color of the embryo shown in (8) is from GUS™ activity. 
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Figure 6. Multiple marker staining of Xenopus embryos. Embryos were co-injected with 0.5 
ng oUacZ mRNA and 1 ng of either N358S GUS (A, C) or GUIS*" (B, D), and subsequently fixed 
in 3.7% formaldehyde for 20 min. Following fixation, embryos were reacted with X-gluc (light 
blue/green in all frames). As in Figure 5, embryos injected with mRNA encoding GUS AR (B) 
stained much more intensely than those injected with N358S GUS mRNA (A). All embryos 
were subsequently rinsed free of X-gluc and stained with the chromogenic substrate for p- 
galactosidase, rose-gal (red in C and D). 



GUS AR as a universal reporter gene. Since most naturally 
occurring (5-glucuronidases are likely to be fixative-labile, the fix- 
ative-resistant GUS AR we have isolated should prove useful for 
expanding the range and power of GUS staining techniques. In 
addition, it should be possible to develop methods for following 
multiple genes or cell lineages in parallel. Such methods generally 
rely on protocols in which fixed tissues are reacted with antibodies 
conjugated to dyes or reporter enzymes (for example, see ref. 24). 

Reporter genes are very commonly used in Xenopus as cell lin- 
eage tracers, and have proved important for gene expression stud- 
ies in developing embryos 25 20 . Following mRNA microinjection, 
the fixative-resistant GUS could be specifically followed in 
Xenopus relative to both background activity and the wild-type 
reporter. Moreover, a lineage trace in tandem with p-galactosidase 
demonstrated the use of GUS in a multiple-reporter format. These 
experiments pave the way for the practical development of two- 
enzyme reporter systems, and could potentially be combined with 
a (5-lactamase reporter system developed by Raz et a}. 21 to create 
three-enzyme reporter systems. 

The results in Xenopus embryos are notable in that that no spe- 
cial precautions were taken to enhance gene expression or enzy- 
matic activity. In contrast to the transformation of reporter con- 
structs, microinjected reporter mRNAs do not replicate and their 
dosage progressively decreases as messages are segregated or bro- 
ken down. Further, no attempt was made to increase the signal 
intensity of GUS AR by fusing it to a nuclear localization signal, as 
was the case for the lacZ control. Nor were fluorescent or other 
highly sensitive commercially available GUS substrates 3 utilized. 
In short, the evolved enzyme is itself robust enough so that new 
staining techniques can easily be adapted from extant methods. 

Experimental protocol 

Materials. DNA-modifying enzymes, including restriction enzymes and 
Vent polymerase, were purchased from New England Biolabs (Beverly. 
MA). Deoxyribonuclease I was from GIBCO-BRL (Gaithersburg. MD). 



Taq polymerase was expressed and purified as 
described by Grimm and Arbuthnot 28 . DNA 
sequencing kits were from Perkin-Elmer/Applied 
Biosystems (Foster City, CA). Cloning vector 
pGEM-5 was from Promega (Madison, Wl), 
pBluescrlpt II SK(+) was from Stratagene (La Jolla. 
CA), and the regulatory vector pREP4 was from 
Qiagen (Chatsworth, CA). pGUS N358S was from 
Clontech (Palo Alto, CA). and pET28a(+) from 
Novagen (Madison. WI). DNA purification columns 
were purchased from Qiagen (Chatsworth, CA). X- 
gluc was from Gold Biotechnology (St. Louis, MO) 
and Butterfly nitrocellulose membranes from 
Schleicher and Schuell (Keene. NH). The mMessage 
mMachine SP6 in vitro mRNA transcription kit was 
from Ambion (Austin, TX). MicroSpin G-25 
Sephadex spin columns were from Pharmacia 
Biotech (Piscataway, NJ). Escherichia coii strain 
InvctF' was from Invitrogen (Carlsbad. CA), W31 10 
(ATCC No. 27325) from the ATCC (Rockville. MD), 
GMS407 from the E. coii Genetic Stock Center (New 
Haven, CT), and BL21 (DE3)pLysS from Novagen. 
Other chemicals, including glutaraldehyde. PNPG 
and NBT, were from Sigma Chemicals (St. Louis, 
MO). 

Cloning of gusA. The E. coiigusA gene was ampli- 
fied from W3110 cells using Vent polymerase and 
the primers 5 ' -CCGG ATC£I£IA£AG ATGT- 
TACGTCCTGTAGAAACC-3' and 5'- 

GCGAAEECTGCAGTCATTGTTTGCCTCCCTGCT- 
3' (Xba\ and £coRI sites underlined). The PCR 
product was blunt-end ligated into the £coRV site of 
pGEM-5 by standard methods 29 . Escherichia coii 
InvctF' cells were transformed by the method 
described by Inoue et a/. 30 . The gusA gene was subcloned into pBluescript 
II SK(+) using restriction endonucleases Xbal and £coRI. The nucleotides 
encoding the lacZ a- fragment that would normally have been located 
between the ribosome binding site and the gusA start codon were deleted by 
amplifying the remainder of the plasmid using primers 5'-CCGGATCC TC- 
IA£AGATGTTACGTCCTGTAGAAACC-3' and 5'-CGJ£JMAAGCT- 
GTTTCCTGTGTGAAATTG-3'. digesting with Xbal, ligating, and trans- 
forming pREP4/InvaF' (see below). The resultant construct placed the gusA 
gene under direct control of the lac promoter. The GUS expression vector was 
named gusA-pBSA. 

Library construction and screening. For the first round of screening, ran- 
dom mutations were introduced into the cDNA by mutagenic PCR 15 using 
primers 5'-CCCAGTCACGACGTTGTAAAA CGACG-3' and 5'-ATGCTTC- 
CGGCTCGTATGTTGTGTGG-3 ' , which anneal to the pBluescript II SK(+) 
vector outside of the boundaries of the gusA insert. The amplification reac- 
tion was carried out with 100 nM primers, 60 mM Tris-HCl pH 8.5, 15 mM 
(NH 4 )SC\ 3.2 mM MgCl 2 . 0.125 mM MnCl 2 , 0.2 mM dGTP. 0.2 mM dATP. 
0.4 mM dTTP, 0.4 mM dCTP, for 35 cycles of 94°C x 30 s, 72°C x 2 min. The 
gusA-pBSA plasmid library was transformed into E, coii InvctF' cells harbor- 
ing the lad expression vector pREP4. The plasmid was unstable when propa- 
gated in E. coii InvaF' without pREP4, probably because the lac repressor is 
not present at high enough levels to limit expression of ihegusA. For colony- 
lift assays. gusA~ pBSA/pREP4/InvctF' colonies were propagated on liquid 
Luria Broth supplemented with 25 ug/ml kanamycin and 100 ug/mL ampi- 
cillin (LB-kan/amp) + 0.4% glucose plates for 12 h at 37°C. The colonies 
adsorbed to a nitrocellulose filter and were transferred colony side up to LB- 
kan/amp plates containing 0.5 mM isopropyl p-D-thiogalactopyranoside 
(IPTG), and induced at 37°C for 12-24 h. The nitrocellulose-bound colonies 
were transferred to GUS buffer (50 mM sodium phosphate pH 7.0, 0.1% 
Triton X-100, 1 mM EDTA) containing 0,2% glutaraldehyde and incubated 
for 20 min at 23°C. The filters were then transferred to buffer containing 165 
ug/ml X-gluc and 330 ug/ml NBT and incubated for 10-30 min. The filters 
were incubated briefly in 4 M guanidine hydrochloride to arrest color devel- 
opment. Those colonies on the master plate corresponding to the darkest 
colony remnants on the filter were isolated and amplified. 

For subsequent rounds of screening, the alleles were randomly recom- 
bined and mutated by DNA shuffling as described by Sternmer 17 . In short, 
the gusA variants were PCR amplified using the same primers as in the 
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mutagenic PCR reactions already described, partially digested with DNase 
I, and reassembled in a PCR reaction without primers. The products were 
amplified in a PCR with primers, then subcloned back into gusA-pBSA for 
screening. The second and third rounds were carried out in the same way, 
except that 1.0% and 3.5% glutaraldehyde was used to fix the colonies before 
the incubation in X-gluc and NBT. The most resistant round 3 clone was iso- 
lated and sequenced at the University of Texas, Institute of Cellular and 
Molecular Biology Core Facility using the Applied Biosystems protocol, via the 
primers originally used for mutagenic PCR and two additional internal 
primers: 5'-CGCCGGGAATGGTGATTACC-3' and 5'-CTGATGGTATCGGT- 
GTGAGCG-3'. 

In vitro characterization of enzyme activity. For the preparation of lysates, 
gu5/4-pBSA/pREP4/GMS407 cells were propagated at 37°C in LB-kan/amp. 
The gusA gene was induced by the addition of 0.5 mM 1PTG to mid-log (ODgoo 
= 0.3) cultures, and the induced cultures were grown overnight. Cells were cen- 
trifuged. resuspended in distilled water, centrifuged again, and resuspended in 
GUS buffer. Cells were lysed with the addition of 10 mM EDTA and 1 mg/ml 
chicken lysozyme. The insoluble fraction was centrifuged down, and the alde- 
hyde resistance of the GUS in the supernatant was determined as follows. 
Glutaraldehyde or formaldehyde was added to an aliquot of supernatant and 
the mixture was incubated at 23°C for exactly 20 min. The mixture was then 
diluted 1/100 into GUS buffer containing 0.5 mM PNPG. The hydrolysis of the 
substrate was followed for 1 min at 23°C at 405 nm in a Shimadzu UV-1601 
spectrophotometer. The absorption extinction coefficient of p-nitrophenol 
under' these conditions was 11.50 mM" 1 cm" 1 . The initial rates of hydrolysis 
were linear (data not shown). 

To generate purified GUS enzymes, the wild -type and evolved gusA genes 
were amplified by PCR with the primers: S'-GCTCTAGAGCAIMGT- 
TACGTCCTGTAGAAACC-3' and 5'-GC£&mQTGCAGTCATTGTTTGC- 
CTCCCTGCT-3' and subcloned into the expression vector pET28a(+) using 
the restriction enzymes Ndei and EcoRI (sites underlined in primers). The 
resultant genes were sequenced as described already to confirm that no addi- 
tional mutations had been introduced during amplification or cloning. The 
expression constructs were transformed into BL21 (DE3)/pLysS. The trans- 
formed strains were propagated and induced, and the proteins purified by 
nickel chelate chromatography, as suggested by Novagen (Madison, WI). The 
protein preparations were judged to be >99% pure following SDS-PAGE and 
Coomassie Blue staining (data not shown). Purified protein concentrations 
were determined via Bradford protein assays (Bio-Rad. Hercules, CA). 

A 10 pmol quantity of purified GUS protein was preincubated for 20 min at 
23°C in 10 ul of GUS buffer (1 uM) containing 0% or 0.04% formaldehyde. 
Then. 5 ul of protein solution were added to 1 ml of buffer (5 nM) containing 
varying concentrations of PNPG, and the initial velocity of each reaction was 
determined as already described. The kinetic parameters of the wild-type and 
mutant enzymes were calculated by fitting the initial velocity values to the 
Michaelts-Menten equation using the application Kaleidagraph 3.0.5 
(Adelbeck Software, Reading, PA) . 

Expression of gusA in Xenopus embryos. The GUS AR gene was subcloned 
into pGUS N358S: this placed the gene downstream of a Kozak sequence 20 , so 
that its transcript could be recognized by eukaryotic translation systems. 
N358S GUS and GUS* 1 * were subcloned into the Xenopus expression vector 
p64TS. This plasmid provides in vitro- transcribed mRNAs with Xenopus glo- 
bin 5'- and 3 -untranslated regions and greatly increases the amount of protein 
translated from the mRNA 31 . Capped mRNA was produced by in vitro tran- 
scription 32 of the clones already described using the Ambion mMessage 
mMaker SP6 protocol. In vitro transcriptions were also treated with DNase I, 
and the mRNA was purified using a Sephadex G-25 spin column to minimize 
nonspecific toxicity effects. Purified mRNAs were resuspended in sterile water 
for injections. 

Female adult Xenopus were induced to ovulate with human chorionic 
gonadotropin, and eggs were fertilized in vitro. Embryos were dejellied in 3% 
cysteine solution and washed in 0.2 x MMR 33 . Embryos were then reared at 
13-18'C in 0.2x MMR. Microinjections were performed as described 34 . 
Embryos were fixed In MEMFA (0.1 M MOPS, pH 7.4 / 2 mM EGTA/1 mM 
MgS0 4 / 3.7% formaldehyde) for 20 min, and embryos were washed 5x5 min 
in lx PBS. GUS activity was detected using 1 mg/ml X-gluc in a solution of lx 
PBS / 20 mM potassium ferricyanide / 20 mM potassium ferrocyanide / 2 mM 
MgCl 2 / 0.02% NP-40 at 37°C for 2 h. p-Galactosidase was detected using the 
same buffer but substituting rose-gal for X-gluc. Injection experiments repeat- 
ed on different days with different preparations of mRNA gave similar results 
(data not shown). 
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